Gemini Flash
FamilyGoogle · Gemini
Google's fast Gemini line, now led by Gemini 3.5 Flash for stable high-speed multimodal and agentic workloads.
Overview
This is a model family overview. For version-specific details, see the individual model entries linked below.
Gemini Flash is Google’s speed-and-cost tier, designed for tasks where throughput, latency, and price matter alongside strong multimodal reasoning. The family now spans stable production models, preview fast-model experiments, Live/TTS variants, and adjacent browser-control or agent surfaces. The center of gravity moved in May 2026 with Gemini 3.5 Flash, Google’s stable fast frontier model for coding, multimodal understanding, and long-horizon agent workflows.
Current Latest
Gemini 3.5 Flash is the current stable fast/agentic route in the Gemini API. Google’s current catalog also keeps Gemini 3 Flash as a preview entry, Gemini 3.1 Flash-Lite as the efficient Flash-Lite route, and Gemini 3.1 Flash Live/TTS as adjacent audio surfaces. Older Gemini 2.5 Flash entries remain useful compatibility and cost baselines, but they are no longer the newest stable Flash lane.
Strengths
- Very fast inference for latency-sensitive applications
- Stable Gemini 3.5 Flash route for agentic coding and long-horizon workflows
- Competitive pricing relative to Pro tiers
- Full multimodal support across text, image, video, audio, and PDFs
- 1M-token context windows on stable Flash and Flash-Lite
- Flash-Lite variant for the most cost-sensitive workloads
- Preview-tier Gemini 3 Flash and 3.1 variants for teams tracking newer fast-model direction
When to Choose Gemini Flash
- High-volume processing where cost per request matters
- Real-time applications requiring low latency
- Bulk document analysis and extraction pipelines
- Development prototyping before escalating to Pro or managed-agent routes
- Applications where multimodal support is needed at scale
- Teams that want a stable 3.5 Flash production lane while evaluating preview variants
Access
- Google AI Studio
- Google Vertex AI
- Google Gemini consumer products
- Third-party integrations via API
Model Versions
Gemini 3.5 Flash
currentGoogle's stable Gemini 3.5 Flash model for fast frontier multimodal, coding, and long-horizon agent workflows.
Gemini 3.1 Flash TTS Preview
previewGoogle's expressive Gemini 3.1 Flash TTS preview, an API-first text-to-speech model spanning 70+ languages.
Gemini 3.1 Flash Live Preview
previewGoogle's low-latency Gemini 3.1 live model for realtime audio-to-audio and multimodal dialogue.
Gemini 3.1 Flash-Lite
previewGoogle's newer low-cost Gemini preview tier for high-throughput multimodal assistant and automation workloads.
Gemini 2.5 Flash Live Preview
previewGoogle's stable 2.5-era native-audio Live API model for realtime multimodal voice agents.
Gemini 3 Flash
previewGoogle's older Gemini 3 preview Flash route, now superseded by stable Gemini 3.5 Flash for most new fast-model work.
Gemini 2.5 Flash-Lite
currentStable budget Gemini 2.5 tier for large-scale assistant and automation workloads.
Gemini 2.5 Flash
currentStable Gemini 2.5 Flash route balancing multimodal capability, latency, and production cost.