Gemini 3 Flash
Google · Gemini 3
Google's older Gemini 3 preview Flash route, now superseded by stable Gemini 3.5 Flash for most new fast-model work.
Overview
Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on May 24, 2026.
Gemini 3 Flash is Google’s older preview-tier fast model in the Gemini API catalog. It originally gave teams a way to evaluate the Gemini 3 fast-model direction above the stable 2.5 Flash line. As of Google’s May 2026 Gemini 3.5 launch, new production-oriented fast/agentic work should usually start with Gemini 3.5 Flash, which is listed as stable.
This entry remains useful for teams that already evaluated or integrated the preview model and need to understand how it fits in the current Flash family.
Capabilities
Gemini 3 Flash is meant for the same broad class of work that made Flash useful in the first place: responsive assistants, extraction flows, multimodal chat, and agentic workloads that need good price-performance rather than maximum reasoning depth.
The current distinction is lifecycle. Gemini 3 Flash remains preview, while Gemini 3.5 Flash is the newer stable model for sustained frontier performance on agentic and coding tasks.
Technical Details
Google’s current Gemini API model catalog lists Gemini 3 Flash with:
- 1,048,576 token context window
- 65,536 max output tokens
- multimodal input support across text, images, audio, video, and files
Google’s current model catalog lists Gemini 3 Flash as preview and Gemini 3.5 Flash as stable. Gemini 3 Flash should therefore be treated as an evaluation or compatibility route rather than the default new-build choice.
Pricing & Access
Google’s current Gemini API pricing still lists Gemini 3 Flash at:
- Input: $0.50 per 1M text, image, or video tokens
- Input: $1.00 per 1M audio tokens
- Output: $3.00 per 1M tokens
Audio input is priced separately at a higher rate. Access is through Gemini API and Google AI Studio / Vertex AI surfaces where preview models are enabled.
Best Use Cases
Use Gemini 3 Flash for compatibility with existing preview-model tests, regression comparisons against the newer Gemini 3.5 Flash route, and controlled evaluation where preview behavior is acceptable.
For most new high-speed Gemini model selection, start with Gemini 3.5 Flash instead.
Comparisons
- Gemini 3.5 Flash (Google): Newer stable fast/agentic route and the better default for new work.
- Gemini 2.5 Flash (Google): Older stable production route for compatibility and cost comparisons.
- Gemini 3.1 Flash-Lite (Google): More cost-focused preview variant with a lower quality ceiling on difficult tasks.
- GPT (OpenAI): Another fast production-model family choice, with the main tradeoff usually being platform strategy and multimodal tooling.