GPT-5 nano
OpenAI · GPT-5
Ultra-low-cost GPT-5 tier for high-throughput automation and lightweight reasoning tasks.
Overview
Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on April 4, 2026.
GPT-5 nano is a budget-first GPT-5 family tier intended for large-scale operational workloads. It is useful when latency and per-request cost dominate, and tasks are structured enough to avoid needing deep multi-step reasoning. OpenAI still lists it as an active model, but its current docs now recommend GPT-5.4 nano for most new speed- and cost-sensitive builds.
Capabilities
The model works well for classification, routing, normalization, lightweight extraction, and short transformation tasks. It can also support simple coding and templated content tasks when instructions are precise.
Technical Details
GPT-5 nano sits at the high-throughput edge of the GPT-5 lineup with a 400K context window, 128K max output, image input, and support for common Responses API tools. Teams should design prompts and guardrails for determinism and brevity to maximize cost-performance.
Pricing & Access
Available through OpenAI API channels where GPT-5 variants are exposed. Published text-token pricing remains 0.40 per 1M output tokens. For new deployments, GPT-5 nano is best treated as the older ultra-cheap baseline that now sits below GPT-5.4 nano.
Best Use Cases
Ideal for preprocessing pipelines, event labeling, ticket triage, content normalization, and background automation where very high request volume is expected.
Comparisons
Compared with GPT-5.4 nano, GPT-5 nano is cheaper but less capable on current coding and tool-heavy workloads. Compared with GPT-5 mini, GPT-5 nano prioritizes cost and throughput over richer reasoning depth. Compared with Gemini 2.5 Flash-Lite, tradeoffs are primarily ecosystem, latency, and output-style preferences.