Vidu Q1

Overview

Vidu Q1 is the next-generation video generation model from Vidu, designed for high-quality video creation. It consistently outputs 5-second, 24-frame, 1080P video clips. Through advanced optimization of visual clarity, Vidu Q1 delivers significantly enhanced image quality with notable improvements in issues such as hand distortion and frame jitter. The model achieves photorealistic rendering that closely resembles real-world scenes, while maintaining stylistic accuracy in 2D animation. Transitions between the first and last frames are exceptionally smooth, making Vidu Q1 well-suited for demanding creative applications in film, advertising, and animated short productions.

viduq1-image
viduq1-start-end
viduq1-text

Price

$0.4 / video

Capability

Image-to-Video Generation

Duration

Clarity

1080P

Capability Description

Image-to-Video Generation

Generate a video by providing a starting frame or both starting and ending frames along with corresponding text descriptions.

Start and End Frame

Support input of two images: the first uploaded image is treated as the starting frame, and the second as the ending frame. The model uses these images as input parameters to generate the video.

Text-to-Video Generation

Generate a video from a text prompt; currently supports both a general style and an anime style optimized for animation.

The URL link for the video generated by the model is valid for one day. Please save it as soon as possible if needed.

Usage

Film Generation

By inputting script excerpts, concept art, and other materials, users can generate promotional videos, visual effects shots, and auxiliary film assets
Delivers theatrical-level clarity and visual quality with complete frame details
Provides professional-grade video transitions with natural scene flow

Anime Production

Input character designs and storyboard scripts to quickly generate 2D animated sequences and stylized anime shorts
Supports styles such as Chinese animation and Japanese anime
Enables storyline extension and creative regeneration of classic IPs

Short Drama Production

Automatically generate short videos or micro-dramas from novel chapters or scripted scenes
Covers diverse genres such as romance, mystery, and historical drama
Optimized for multi-platform distribution needs

Advertising & Marketing

Quickly generate highly engaging brand ads, e-commerce product videos, and interactive ads (e.g., virtual try-on) based on product images and feature descriptions
Supports adaptation to various platform dimensions and creative formats

Resources

API Documentation: Learn how to call the API.

Introducting ViduQ1

Cinematic-Level Visual Clarity

The model delivers a comprehensive upgrade in visual detail restoration.

Precise Resolution of Visual Artifacts

Movements are smooth and natural—hand gestures during product demonstrations in e-commerce livestreams are accurately rendered and compliant. Visual jitter is minimized through dynamic frame interpolation technology, ensuring fluid and stable footage even in motion-heavy scenes such as running shots or vehicle perspectives.

Multi-Style Artistic Expression

The realistic style aims for lifelike visuals—urban landscapes and character portraits in city promos are rendered with striking realism. The animated style focuses on authenticity, accurately capturing everything from the hand-drawn lines of Japanese anime to the saturated colors of Western cartoons. By inputting anime character designs, the model generates dynamic story segments that closely match the original IP’s visual style, boosting the efficiency of derivative content creation.

Realistic Style
Animated Style

Industry-Leading Transition Smoothness

The start and end frame transition technology reaches a new level, using dynamic frame prediction and style fusion algorithms to overcome the limitations of “mechanical stitching” in video transitions.

Quick Start

1. Text-to-Video Generation

Curl
Python

curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \
--header 'Authorization: Bearer {your apikey}' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model": "viduq1-text",
    "style": "anime",
    "prompt": "Peter Rabbit drives a small car along the road, his face filled with joy and happiness.",
    "duration": 5,
    "aspect_ratio": "16:9",
    "size": "1920x1080",
    "movement_amplitude": "auto"
}'

2. Image-to-Video Generation

Curl
Python

curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \
--header 'Authorization: Bearer {your apikey}' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model":"viduq1-image",
    "image_url":"https://example.com/path/to/your/image.jpg",
    "prompt":"Peter Rabbit drives a small car along the road, his face filled with joy and happiness.",
    "duration":5,
    "size":"1920x1080",
    "movement_amplitude":"auto"
}'

3. Start and End Frame

Curl
Python

curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \
--header 'Authorization: Bearer {your apikey}' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model":"viduq1-start-end",
    "image_url":["https://example.com/path/to/your/image.jpg","https://example.com/path/to/your/image1.jpg"],
    "prompt":"Peter Rabbit drives a small car along the road, his face filled with joy and happiness.",
    "duration":5,
    "size":"1920x1080",
    "movement_amplitude":"auto"
}'

Get Started

Language Models

Visual Language Models

Image Generation Models

Video Generation Models

Image Generation Models

Capabilities

Tools

Agents

Overview

Price

Capability

Duration

Clarity

Capability Description

Image-to-Video Generation

Start and End Frame

Text-to-Video Generation

Usage

Resources

Introducting ViduQ1

Cinematic-Level Visual Clarity

Precise Resolution of Visual Artifacts

Multi-Style Artistic Expression

Industry-Leading Transition Smoothness

Quick Start

1. Text-to-Video Generation

2. Image-to-Video Generation

3. Start and End Frame

Get Started

Language Models

Visual Language Models

Image Generation Models

Video Generation Models

Image Generation Models

Capabilities

Tools

Agents

​ Overview

Price

Capability

Duration

Clarity

​ Capability Description

Image-to-Video Generation

Start and End Frame

Text-to-Video Generation

​ Usage

​ Resources

​ Introducting ViduQ1

Cinematic-Level Visual Clarity

Precise Resolution of Visual Artifacts

Multi-Style Artistic Expression

Industry-Leading Transition Smoothness

​ Quick Start

​1. Text-to-Video Generation

​2. Image-to-Video Generation

​3. Start and End Frame

Overview

Capability Description

Usage

Resources

Introducting ViduQ1

Quick Start

1. Text-to-Video Generation

2. Image-to-Video Generation

3. Start and End Frame