text-to-video

Vidu Q3 Text-To-Video

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. It features multiple styles, resolutions up to 1080p, flexible duration, audio generation, and motion control.

Use one of our client libraries to get started quickly.

Vidu Q3 Text-To-Video

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. It features multiple styles, resolutions up to 1080p, flexible duration, audio generation, and motion control.. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Video output~5-15sfrom $0.42/run

1. Calling the API

Submit a request

Send a POST request to start generation. The API returns immediately with a prediction ID for polling.

curl -X POST "https://api.vibedream.ai/api/v1/models/vidu-q3-t2v/generate" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $VIBEDREAM_API_KEY" \
  -d '{
    "prompt": "A beautiful sunset over mountains with golden light",
    "style": "general",
    "resolution": "540p",
    "duration": 1,
    "aspect_ratio": "16:9",
    "movement_amplitude": "auto",
    "generate_audio": false,
    "bgm": false,
    "seed": -1
}'

2. Authentication

The API uses an API Key for authentication.

Get your API Key

Get your API key from vibedream.ai/models/api-keys.

Environment variable

export VIBEDREAM_API_KEY="your-api-key"

3. Queue & Results

Generation requests are queued and processed asynchronously. Poll the prediction endpoint until status is completed or failed.

Submit request

curl -X POST "https://api.vibedream.ai/api/v1/models/vidu-q3-t2v/generate" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $VIBEDREAM_API_KEY" \
  -d '{
    "prompt": "A beautiful sunset over mountains with golden light",
    "style": "general",
    "resolution": "540p",
    "duration": 1,
    "aspect_ratio": "16:9",
    "movement_amplitude": "auto",
    "generate_audio": false,
    "bgm": false,
    "seed": -1
}'

Response

Returns immediately with a prediction ID. Use id to poll for results.

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "processing",
  "statusUrl": "https://api.vibedream.ai/api/v1/predictions/550e8400-e29b-41d4-a716-446655440000",
  "estimatedTime": "5-15s",
  "costCents": 42,
  "createdAt": "2025-01-15T12:00:00.000Z"
}

Get the result

Polling

# Replace YOUR_PREDICTION_ID with the id from the submit response
curl "https://api.vibedream.ai/api/v1/predictions/YOUR_PREDICTION_ID" \
  -H "Authorization: Bearer $VIBEDREAM_API_KEY"

# Poll every 1-2s until status is "completed" or "failed"

Completed response

outputs is string[] — an array of direct download URLs hosted on assets.vibedream.ai.

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "modelId": "vidu-q3-t2v",
  "modelName": "Vidu Q3 Text-To-Video",
  "status": "completed",
  "outputs": [
    "https://assets.vibedream.ai/outputs/550e8400-e29b-41d4-a716-446655440000/1736942400000-0.mp4"
  ],
  "error": null,
  "createdAt": "2025-01-15T12:00:00.000Z",
  "completedAt": "2025-01-15T12:00:30.000Z"
}

Failed response

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "modelId": "vidu-q3-t2v",
  "modelName": "Vidu Q3 Text-To-Video",
  "status": "failed",
  "outputs": null,
  "error": "Your request was flagged by content moderation. Please modify your prompt.",
  "createdAt": "2025-01-15T12:00:00.000Z",
  "completedAt": "2025-01-15T12:00:05.000Z"
}
FieldTypeDescription
idstringUnique prediction ID (UUID).
modelIdstringID of the model used for generation.
modelNamestringHuman-readable model name.
statusstringCurrent status. One of: processing, completed, failed.
outputsstring[] | nullArray of output URLs. Each URL is a direct download link to the generated file on assets.vibedream.ai. null when still processing or failed.
errorstring | nullError message if the generation failed. null on success.
createdAtstringISO 8601 timestamp when the request was submitted.
completedAtstring | nullISO 8601 timestamp when generation finished. null while processing.

4. Schema

Input

ParameterTypeRequiredDefaultRangeDescription
promptstringYes----Text description of the video scene and action
stylestringNo--general, animeChoose between general realistic style or anime aesthetic.
resolutionstringNo--540p, 720p, 1080pOutput quality for the video.
durationnumberNo--1 – 16 (step 1)Video length in seconds (1-16).
aspect_ratiostringNo--16:9, 4:3, 9:16Output ratio for the video.
movement_amplitudestringNo--auto, small, medium, largeMotion intensity level.
generate_audiobooleanNo----Whether to generate synchronized audio.
bgmbooleanNo----Add background music to the output.
seednumberNo---1 – -- (step 1)Random seed for reproducibility (-1 for random).

Example request

{
    "prompt": "A beautiful sunset over mountains with golden light",
    "style": "general",
    "resolution": "540p",
    "duration": 1,
    "aspect_ratio": "16:9",
    "movement_amplitude": "auto",
    "generate_audio": false,
    "bgm": false,
    "seed": -1
}

Output

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "modelId": "vidu-q3-t2v",
  "modelName": "Vidu Q3 Text-To-Video",
  "status": "completed",
  "outputs": [
    "https://assets.vibedream.ai/outputs/550e8400-e29b-41d4-a716-446655440000/1736942400000-0.mp4"
  ],
  "error": null,
  "createdAt": "2025-01-15T12:00:00.000Z",
  "completedAt": "2025-01-15T12:00:30.000Z"
}