APIPod
HomeModelsPricingBlogDocs
APIPod

Enterprise-grade AI API aggregation platform. Access top global models in one place.

Product

  • Pricing
  • Models

Resources

  • Documentation
  • Changelog
  • Blog

Company

  • About Us
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2026 APIPod. All rights reserved.

Sitemap
/
/
  1. Home
  2. Models
  3. Veo 3.1
Back
Google

Veo 3.1

5 models available
Provider: Google
Updated: Recently
Google DeepMind's upgraded AI video model offers realistic motion generation, extended video duration, multi-image reference control, and synchronized native audio output, supporting 1080p image quality.

Available Models

Google

Veo 3.1 Quality 4K

veo3-1-quality-4k

Veo 3.1 Quality's 4K version.

Input type
Price
$1.00 / request
Context
4k
Output type
Video
Max Output
4k
Google

Veo 3.1 Fast 4K

veo3-1-fast-4k

Veo 3.1 Fast's 4K version

Input type
Price
$0.150 / request
Output type
Video
Google

VEO 3.1 Fast Reference

veo3-1-fast-ref

You can provide different reference images (up to 3) to shape character design, lighting style, or color tone, ensuring that the generated video maintains visual consistency in each shot.

Input type
Price
$0.070 / request
Output type
Video
Google

Veo 3.1 Quality

veo3-1-quality

Create high-quality, 8-second videos with sound using Gooles's state-of-the-art video generation model.

Input type
Price
$0.500 / request
Output type
Video
Google

Veo 3.1 Fast

veo3-1-fast

Veo 3.1 Fast for Generation transforms creative ideas into compelling video narratives using Google's advanced video generation model. Veo is capable of generating videos with audio from text prompts, or animating images with textual guidance.

Input type
Price
$0.070 / request
Output type
Video
Everything you need in AI Video

Veo 3.1 API:
Fidelity Refined.

The ultimate multimodal architecture for cinematic video, native audio, unmatched consistency.

View Documentation
8s+
Clip Duration
1080p
HD Resolution
24fps
Cinematic Framerate

Multimodal Prompt

A cinematic drone shot of a futuristic city with neon lights, cyberpunk style, 8k resolution...

Three Modes for Every Vision

Whether you need rapid prototyping or cinematic perfection, Veo 3.1 provides the right engine for the task.

Flagship

Quality Mode

The gold standard for AI video. Using advanced diffusion sampling, it produces HD clips with breathtaking textures and physical realism.

  • Cinematic Light & Shadows
  • 8s+ High Fidelity Renders
  • Temporal Stability
Efficient

Fast Mode

5x faster generation times. Designed for social media, live interactions, and rapid creative exploration without compromising the core architecture.

  • Near Real-time Previews
  • Low Latency API Calls
  • Optimized Tokens Usage
Control

Reference Engine

Unlock true visual consistency. Supply up to 3 images to anchor your characters, art styles, and environments across multiple clips.

  • 3-Image Anchor System
  • Character Persistence
  • Style Guide Matching
Audio Visual Harmony

Hear Your Imagination.

Veo 3.1 is the first model to truly master native audio. Whether it's the roar of an engine or the subtle rustle of leaves, our architecture generates synchronized soundscapes that breathe life into every frame.

Lip Sync

Natural dialogue synchronization with character motion.

Spatial Audio

Sound that follows the camera through 3D space.

Visual Continuity, Redefined.

Maintaining consistency in AI video used to be impossible. With Veo 3.1's Multi-Image Reference, you provide the ingredients—Character, Style, and Scene—and the model does the rest.

  • Character Persistence across different camera angles
  • Brand-fixed color palettes and lighting styles
  • Exact location referencing for episodic content
Char Ref
Style Ref
Scene Ref

Scale with the Veo 3.1 API

Integrate Veo 3.1 directly into your SaaS, game engine, or creative tool. Our API handles the heavy lifting while you build the future.

Clean REST API
Standardized Python, JS and Go SDKs
Real-time Polling
Efficient task management & queueing
Get API Access
# Initializing Veo 3.1 API
payload = {
  "model": "veo-3-1-quality", # or "veo-3-1-fast"
  "prompt": "Cinematic landscape, 8k",
  "image_urls": [
    "https://cdn.ai/char_1.jpg",
    "https://cdn.ai/style.jpg"
  ],
  "aspect_ratio": "16:9"
}

response = requests.post(
  "https://api.apipod.ai/v1/videos/generations",
  json=payload,
  headers={"Authorization": "Bearer Key"}
)

Questions & Answers

How many reference images can I upload?
Veo 3.1 supports up to 3 reference images in a single request. This allows you to define a character, an environment, and a specific lighting style all at once, ensuring perfect subject consistency.
Does Veo 3.1 generate audio for all videos?
Yes! Native audio generation is a core feature of the Veo 3.1 architecture. Every video generated includes a synchronized audio track that matches the scene's content perfectly.
What is the difference between Fast and Quality modes?
Quality mode prioritizes maximum visual fidelity and temporal stability (ideal for final renders). Fast mode is optimized for speed, delivering usable results up to 5x faster for prototyping.
Which aspect ratios are supported?
Veo 3.1 natively supports multiple aspect ratios: cinematic 16:9, vertical 9:16 (perfect for mobile content), square 1:1, and traditional 4:3.
Can I control the camera movement?
Absolutely. Veo 3.1 understands cinematic language. You can use terms like "Dolly Zoom", "Orbit", "Panning", or "Crane Shot" in your prompt, and the model will execute the camera motion with precision.
Is there a commercial license for APIpod generated content?
All content generated via our API is protected by our safety filters. For commercial ownership, it typically belongs to the account holder who generated the content. Please refer to our Terms of Service for full details.

Create the Next Masterpiece.

Veo 3.1 is now available for production workloads via the APIpod Platform.

Talk to us