CogVideoX-2 is Z.AI’s next-generation large-scale video generation model, with a 38% improvement in text-to-video capabilities. It achieves significant optimizations in large-scale motion, frame stability, instruction compliance, artistic style, and visual aesthetics.
Price | Input Modality | Output Modality |
---|---|---|
$0.1 / video | Image/Text | Video |
Prompt | Video |
---|---|
Peter Rabbit (main subject) drives a small car (subject action), wandering along the road (environment description), with a joyful and delighted expression on his face (atmosphere setting). | |
A journey across the desert, a caravan of camels walks over golden sand dunes, the setting sun paints the sky red, creating a magnificent and tranquil scene. | |
Close-up shot (camera description), bathed in the soft light of dusk (lighting), a parrot stands on the balcony railing, with purple feathers and a pink beak (subject description), set against a backdrop of city skyscrapers (environment description). |
Prompt | Video |
---|---|
![]() | |
![]() | |
![]() a slice of pork curls up into a massive wave. A tiny figure bravely surfs on this “wave,” with the surfboard kicking up delicate splashes. |