Custom Voice
POST
/kling/v1/general/custom-voices
Create a custom voice for use in later speech synthesis and related features.
Authorizations
bearer
Type
HTTP (bearer)
Request Body
application/json
voice_name
string
Required
Voice name
voice_url
string
URL to fetch the voice source file.
Supports .mp3, .wav, .mp4, and .mov audio/video files.
The recording must contain a single clean human voice with no background noise, between 5 and 30 seconds long.
video_id
string
Historical work ID; reference a past generation to supply audio material.
Only videos that meet all of the following may be used for custom voice training:
- Generated with model version V2.6 and
soundset toon - Created via the digital human API
- Created via the lip-sync API
The audio must contain a single clean human voice with no background noise, between 5 and 30 seconds long.
callback_url
string
Callback URL when the task completes.
external_task_id
string
Caller-defined task ID for reconciliation and lookup.
Responses
OK
application/json
object