Gemini Omni Flash Video Generator

A unified video engine covering four modes under one model: text-to-video, image-to-video, reference-to-video with up to four images, and video editing from an existing clip. Generates 16:9 or 9:16 clips from 3 to 10 seconds long.

Key Features

Four modes in one model: text-to-video, image-to-video, reference-to-video, and video editing

Reference-to-video supports up to four images for consistent characters or objects

Video editing takes a source clip and a prompt, with output length matching the source

16:9 or 9:16 aspect ratio

3 to 10 second durations for generation modes, defaulting to 8 seconds

Per-second pricing: 18 gold for text/image-to-video, 20 gold for reference-to-video or editing

Gemini Omni Flash Specifications

Model NameGemini Omni Flash
Input ModesText, Image, Multiple Reference Images (up to 4), Video Edit
Aspect Ratios16:9, 9:16
Durations3-10s (default 8s); edit mode inherits source duration
Pricing18 gold/s (text/image-to-video), 20 gold/s (reference-to-video, video edit)
AccessSubscriber

Gemini Omni Flash vs Kling O1

Gemini Omni Flash

  • 16:9 or 9:16 only, 3-10 second durations.
  • Up to 4 reference images in reference-to-video mode.
  • Per-second pricing: 18 gold (text/image-to-video), 20 gold (reference-to-video, edit).

Kling O1

  • 1:1, 16:9, or 9:16, with an optional end frame.
  • Up to 7 reference images (4 with a video reference).
  • 55 gold per 5 seconds, flat regardless of mode.