Vidu Q1 is the next-generation video generation model from Vidu, designed for high-quality video creation. It consistently outputs 5-second, 24-frame, 1080P video clips. Through advanced optimization of visual clarity, Vidu Q1 delivers significantly enhanced image quality with notable improvements in issues such as hand distortion and frame jitter.The model achieves photorealistic rendering that closely resembles real-world scenes, while maintaining stylistic accuracy in 2D animation. Transitions between the first and last frames are exceptionally smooth, making Vidu Q1 well-suited for demanding creative applications in film, advertising, and animated short productions.
Generate a video by providing a starting frame or both starting and ending frames along with corresponding text descriptions.
Start and End Frame
Support input of two images: the first uploaded image is treated as the starting frame, and the second as the ending frame. The model uses these images as input parameters to generate the video.
Text-to-Video Generation
Generate a video from a text prompt; currently supports both a general style and an anime style optimized for animation.
The URL link for the video generated by the model is valid for one day. Please save it as soon as possible if needed.
The model delivers a comprehensive upgrade in visual detail restoration.
2
Precise Resolution of Visual Artifacts
Movements are smooth and natural—hand gestures during product demonstrations in e-commerce livestreams are accurately rendered and compliant. Visual jitter is minimized through dynamic frame interpolation technology, ensuring fluid and stable footage even in motion-heavy scenes such as running shots or vehicle perspectives.
3
Multi-Style Artistic Expression
The realistic style aims for lifelike visuals—urban landscapes and character portraits in city promos are rendered with striking realism. The animated style focuses on authenticity, accurately capturing everything from the hand-drawn lines of Japanese anime to the saturated colors of Western cartoons. By inputting anime character designs, the model generates dynamic story segments that closely match the original IP’s visual style, boosting the efficiency of derivative content creation.
4
Industry-Leading Transition Smoothness
The start and end frame transition technology reaches a new level, using dynamic frame prediction and style fusion algorithms to overcome the limitations of “mechanical stitching” in video transitions.
curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \--header 'Authorization: Bearer {your apikey}' \--header 'Content-Type: application/json' \--data-raw '{ "model": "viduq1-text", "style": "anime", "prompt": "Peter Rabbit drives a small car along the road, his face filled with joy and happiness.", "duration": 5, "aspect_ratio": "16:9", "size": "1920x1080", "movement_amplitude": "auto"}'
curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \--header 'Authorization: Bearer {your apikey}' \--header 'Content-Type: application/json' \--data-raw '{ "model":"viduq1-image", "image_url":"https://example.com/path/to/your/image.jpg", "prompt":"Peter Rabbit drives a small car along the road, his face filled with joy and happiness.", "duration":5, "size":"1920x1080", "movement_amplitude":"auto"}'
curl --location --request POST 'https://api.z.ai/api/paas/v4/videos/generations' \--header 'Authorization: Bearer {your apikey}' \--header 'Content-Type: application/json' \--data-raw '{ "model":"viduq1-start-end", "image_url":["https://example.com/path/to/your/image.jpg","https://example.com/path/to/your/image1.jpg"], "prompt":"Peter Rabbit drives a small car along the road, his face filled with joy and happiness.", "duration":5, "size":"1920x1080", "movement_amplitude":"auto"}'