Gemini 2.5 Flash

Google · Gemini 2.5

Fast Gemini tier balancing multimodal capability, latency, and cost for production assistants.

Part of Gemini Flash family · Other versions: Gemini 2.5 Flash-Lite
Type
multimodal
Context
1M tokens
Max Output
66K tokens
Status
current
Input
$0.3/1M tok
Output
$2.5/1M tok
API Access
Yes
License
proprietary
fast multimodal cost-efficient assistant production
Released April 2025 · Updated March 6, 2026

Overview

Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on February 15, 2026.

Gemini 2.5 Flash is Google’s best price-performance model for teams that need responsive output at scale. Google’s model docs position it as the balanced default for large-scale processing, low-latency tasks that still need thinking, and agentic use cases.

Capabilities

Flash handles summarization, extraction, content transformation, and many coding-adjacent tasks with strong latency characteristics. It is useful for applications where user experience depends on fast model response without dropping multimodal support or tool-use capability.

Technical Details

Google’s current model docs list Gemini 2.5 Flash with a 1,048,576 token input window and a 65,536 token output limit. It supports text, image, video, and audio inputs, plus code execution, file search, function calling, structured outputs, search grounding, and URL context.

Pricing & Access

Current Gemini API pricing lists Gemini 2.5 Flash at 0.30per1Mtext/image/videoinputtokensand0.30 per 1M text/image/video input tokens and 2.50 per 1M output tokens, with audio input priced higher. Access is available through Google AI Studio and Vertex AI.

Best Use Cases

Strong choice for customer support assistants, internal copilots, UI-driven chat tools, and automation tasks requiring fast response with good quality.

Comparisons

Compared with Gemini 2.5 Pro, Flash is cheaper and faster but less capable on the hardest reasoning tasks. Compared with GPT-5.3, both are strong production defaults with different ecosystem advantages. Compared with Claude Haiku 4.5, Flash usually offers broader multimodal and tool support at a different cost profile.