Our state-of-the-art video generation model
Google DeepMind's state-of-the-art video generation model family, presented on the official Veo model page.
Public pricing is tiered. Explicitly stated rates for veo-3.1-generate-001 include $0.20/sec (text, 720p/1080p), $0.40/sec (text, 4K), $0.40/sec (audio, 720p/1080p), and $0.60/sec (audio, 4K).
Google's official Veo docs identify Veo as the Vertex AI video-generation model and direct readers to the Veo section of the official Vertex AI pricing page. The explicit public pricing I found for the current Veo 3.1 model (veo-3.1-generate-001) is tiered by prompt type and resolution; this finding records the standard text-prompt rate of $0.20 per second for 720p/1080p output. The same cited source also lists $0.40/sec for text prompts at 4K, $0.40/sec for audio prompts at 720p/1080p, and $0.60/sec for audio prompts at 4K.
A filmmaker shared a seven-step pipeline that uses Gemini for research, Nano Banana Pro for consistent scenes, Kling for image-to-video, Veo for speaking shots, and CapCut for finish. The sequence is useful if you want research, references, motion, and sound separated into controllable stages.
A creator claims Calico can turn listing photos into $15 renovation reels, alongside AI ad formats like fake podcast clips, styled product grids, and surreal brand posters. Use the approach when you need many low-cost variations built from one repeatable concept.