Skip to content

Multimodal Video

POST
/kling/v1/videos/multi-elements

Edit video with multimodal operations such as add, swap, or remove elements based on session and selections.

Authorizations

bearer
Type
HTTP (bearer)

Request Body

application/json
object

Model name. Enum: kling-v1-6

Session ID generated by the video init task; unchanged by selection edits

Operation type. Enum: addition, swap, removal. addition: add element; swap: replace element; removal: remove element

string[]

List of cropped reference images

Positive text prompt

Negative text prompt

Video generation mode.
Enum: std, pro
std: Standard mode — balanced quality and cost.
pro: Pro mode — higher quality output.

Video duration in seconds.
Enum: 5, 10

Responses

OK

application/json
object

Task ID

Task status

Playground

Authorization
Body

Samples