Models

DeepSeek V4 Pro

DeepSeek · DeepSeek V4

DeepSeek's stronger V4 route for million-token reasoning, agentic coding, and higher-end open-weight workloads.

language1M ctxOpen

GLM-5

Zhipu AI · GLM

Zhipu's GLM long-context model with strong coding ability and open-weight plus API access.

language200K ctxOpen

Apr 4, 2026

GLM-5.1

Z.ai · GLM

Z.ai's GLM-5.1, a 744B-parameter MoE open-weight model with strong autonomous coding and tool-use behavior.

language203K ctxOpen

GLM-5.2

Z.ai · GLM

Z.ai's 1M-context MIT-licensed open-weight flagship for coding and long-horizon agent work.

language1M ctxOpen

GPT-5

OpenAI · GPT-5

Original GPT-5 release entry, now superseded in OpenAI guidance by GPT-5.5 and newer GPT-5.4 generation variants.

GPT-5 mini

OpenAI · GPT-5

Older cost-efficient GPT-5 variant now mainly useful for compatibility after GPT-5.4 mini became the recommended fresh-build route.

GPT-5 nano

OpenAI · GPT-5

Older ultra-low-cost GPT-5 tier now mainly useful for compatibility after GPT-5.4 nano became the recommended fresh-build route.

GPT-5-Codex

OpenAI · GPT-5

Earlier GPT-5 Codex release entry kept as a historical baseline in OpenAI's Codex model lineage.

GPT-5.2

OpenAI · GPT-5

Previous GPT-5.2 frontier route now mainly useful for compatibility after GPT-5.5 became OpenAI's recommended fresh-build model.

GPT-5.2 Pro

OpenAI · GPT-5

Previous premium GPT-5.2 reasoning tier now mainly useful for compatibility after GPT-5.5 Pro became OpenAI's newer premium route.

GPT-5.3-Codex

OpenAI · GPT-5

API-published GPT-5.3 Codex route for pinned integrations, deprecated in Codex when using ChatGPT sign-in.

language400K ctx

GPT-5.4

OpenAI · GPT-5

Current GPT-5.4 premium API model for difficult professional work, tool use, and computer-assisted tasks.

GPT-5.4 mini

OpenAI · GPT-5

OpenAI's strongest mini model for coding, computer use, and fast high-volume agent workloads.

language400K ctx

GPT-5.4 nano

OpenAI · GPT-5

OpenAI's cheapest GPT-5.4 route for fast classification, extraction, and lightweight coding subagents.

language400K ctx

GPT-5.4 Pro

OpenAI · GPT-5

Premium GPT-5.4 escalation tier for decision-ready analysis and demanding professional workflows.

GPT-5.5

OpenAI · GPT-5

Still-available GPT-5.5 API route for pinned agentic coding, professional work, research, and migration baselines.

GPT-5.5 Pro

OpenAI · GPT-5

Still-available premium GPT-5.5 API route for pinned high-compute reasoning and long-running professional workflows.

GPT-5.6 Luna

OpenAI · GPT-5.6

OpenAI's lowest-cost GPT-5.6 model for high-volume agents, coding support, extraction, and efficient professional workflows.

GPT-5.6 Sol

OpenAI · GPT-5.6

OpenAI's flagship GPT-5.6 model for complex coding, professional work, science, design, and tool-heavy agents.

GPT-5.6 Terra

OpenAI · GPT-5.6

Balanced GPT-5.6 model for production coding, knowledge work, agents, and long-context workflows at half Sol's token price.

GPT-Rosalind

OpenAI · GPT

OpenAI's life-sciences research preview model for biology, drug discovery, and tool-heavy scientific workflows.

languageN/APreview

Apr 18, 2026

Kimi K2.5

Moonshot AI · Kimi

Moonshot's Kimi K2.5 is an open-weight long-context model focused on agentic reasoning and tool use.

language256K ctxOpen

Apr 4, 2026

MiniMax M2.5

MiniMax · MiniMax M

MiniMax's M2.5, a fast and inexpensive proprietary model for agentic coding, tool use, and high-volume production.

language200K ctx

May 1, 2026

MiniMax M2.7

MiniMax · MiniMax M

MiniMax's M2.7 agentic productivity model for coding, office workflows, tool use, and low-cost long-context execution.

language205K ctx

MiniMax-M2.5

MiniMax · MiniMax M

MiniMax's still-active M-series model for coding, tool use, and office-style agent workflows.

language205K ctx

Mistral Small 4

Mistral AI · Mistral Small

Mistral's new open-weight small model for efficient long-context assistants and coding support.

language256K ctxOpen

Apr 13, 2026

o3

OpenAI · o-series

Still-available OpenAI reasoning model for difficult analysis, retained as a legacy reference after GPT-5 became the default recommendation.

language200K ctxLegacy

o3-deep-research

OpenAI · o-series

OpenAI's highest-capability deep research model for long, source-heavy investigations over web and private data.

language200K ctx

Apr 8, 2026

o4-mini

OpenAI · o-series

Cost-efficient OpenAI reasoning model retained as a legacy API reference after GPT-5 mini became the newer default direction.

language200K ctxLegacy

language128K ctxOpenPreview

o4-mini-deep-research

OpenAI · o-series

Lower-cost OpenAI deep research model for source-heavy investigations when throughput and budget matter.

language200K ctx

Apr 8, 2026

OpenAI Privacy Filter

OpenAI · OpenAI Privacy Filter

OpenAI's Apache 2.0 open-weight model for local PII detection and redaction workflows.

Qwen 3.6 Max Preview

Alibaba · Qwen3

Alibaba's older Qwen 3.6 Max Preview for agentic coding, tool use, and 262K-context reasoning.

language262K ctxPreview

Qwen3-Max

Alibaba · Qwen3

Alibaba's older 262K-context Qwen API tier, scheduled to retire in favor of Qwen3.7 Max.

language262K ctx

Qwen3.5

Alibaba · Qwen

Alibaba's Qwen3.5 generation extends the Qwen line with stronger open-weight reasoning and coding performance.

language262K ctxOpen

Apr 4, 2026

Qwen3.7 Max

Alibaba · Qwen3

Alibaba's proprietary 1M-context Qwen flagship for reasoning, coding, productivity, and long-horizon agents.

Multimodal Models

Claude Fable 5

Anthropic · Claude 5

Anthropic's most capable widely released Claude model for long-running agents and high-stakes software work.

Claude Haiku 4.5

Anthropic · Claude 4

Fast and efficient Claude tier for latency-sensitive assistant and automation workloads.

multimodal200K ctx

Claude Mythos 5

Anthropic · Claude Mythos

Anthropic's limited-availability Mythos model for approved Project Glasswing defensive cybersecurity workflows.

Claude Mythos Preview

Anthropic · Claude Mythos

Anthropic's gated research-preview frontier model for defensive cybersecurity, autonomous coding, and long-running agents.

Claude Opus 4.6

Anthropic · Claude 4

Superseded Claude 4.6 snapshot for high-difficulty reasoning, coding, and long-running agent workflows.

Claude Opus 4.7

Anthropic · Claude 4

Anthropic's April 2026 premium Claude model, now superseded by Opus 4.8 but still useful for pinned deployments.

Claude Opus 4.8

Anthropic · Claude 4

Anthropic's May 2026 premium Claude model for long-horizon agentic coding, complex reasoning, and high-autonomy work.

Claude Sonnet 4.5

Anthropic · Claude 4

Previous balanced Claude Sonnet snapshot, now mainly useful for pinned deployments and historical comparison.

multimodal200K ctxLegacy

Claude Sonnet 4.6

Anthropic · Claude 4

Anthropic's previous balanced Sonnet model, retained for pinned deployments and behavior comparison.

Claude Sonnet 5

Anthropic · Claude 5

Anthropic's balanced Sonnet model for agentic coding, tool use, and production assistants.

Computer Use Preview

OpenAI · Computer Use

OpenAI's Responses API computer-use model for supervised browser and interface-control workflows.

multimodal8K ctxPreview

Gemini 2.5 Flash

Google · Gemini 2.5

Stable Gemini 2.5 Flash route balancing multimodal capability, latency, and production cost.

multimodal131K ctxPreview

Gemini 2.5 Flash Live Preview

Google · Gemini 2.5

Google's stable 2.5-era native-audio Live API model for realtime multimodal voice agents.

Gemini 2.5 Flash-Lite

Google · Gemini 2.5

Stable budget Gemini 2.5 tier for large-scale assistant and automation workloads.

Gemini 3 Flash

Google · Gemini 3

Google's older Gemini 3 preview Flash route, now superseded by stable Gemini 3.5 Flash for most new fast-model work.

multimodal131K ctxPreview

Gemini 3.1 Flash Live Preview

Google · Gemini 3.1

Google's low-latency Gemini 3.1 live model for realtime audio-to-audio and multimodal dialogue.

Gemini 3.1 Flash-Lite

Google · Gemini 3.1

Google's stable low-cost Gemini 3.1 Flash-Lite tier for high-throughput multimodal assistant and automation workloads.

Gemini 3.1 Pro Preview

Google · Gemini 3.1

Google's current premium Gemini 3.1 preview model for multimodal reasoning, coding, long-context analysis, and agentic workflows.

Gemini 3.5 Flash

Google · Gemini 3.5

Google's stable Gemini 3.5 Flash model for fast frontier multimodal, coding, and long-horizon agent workflows.

multimodal131K ctxPreview

Gemini Robotics-ER 1.6

Google · Gemini Robotics

Google DeepMind's robotics-tuned Gemini for embodied reasoning, spatial planning, and physical agent tasks.

GPT-4.1

OpenAI · GPT-4.1

Long-context multimodal model retained as a legacy reference after retirement from ChatGPT defaults.

GPT-4o

OpenAI · GPT-4o

Widely deployed multimodal model kept as a legacy reference after retirement from ChatGPT defaults.

multimodal128K ctxLegacy

GPT-4o mini

OpenAI · GPT-4o

Lower-cost GPT-4o API tier for high-volume text-plus-image assistant and automation workloads.

multimodal128K ctx

Grok 4.20

xAI · Grok

xAI's 1M-context multimodal preview lane for research and multi-agent experiments alongside the Grok 4.5 flagship.

Grok 4.3

xAI · Grok

xAI's lower-cost 1M-context route for general reasoning and agentic work, now sitting behind the Grok 4.5 flagship.

Grok 4.5

xAI · Grok

xAI's current flagship for coding, agentic knowledge work, and tool-using reasoning, with a 500K context window.

multimodal500K ctx

multimodal256K ctxPreview

Grok Build 0.1

xAI · Grok

xAI's lower-cost public-beta coding API model for fast agentic development, now separate from Grok Build's Grok 4.5 default.

Kimi K2.6

Moonshot AI · Kimi

Moonshot AI's Kimi K2.6, a 1T-parameter MoE open-weight model for long-horizon coding and agentic workflows.

multimodal262K ctxOpen

Kimi K2.7 Code

Moonshot AI · Kimi

Moonshot AI's thinking-only multimodal open-weight model for long-horizon coding agents and tool use.

multimodal262K ctxOpen

Llama 4 Maverick

Meta · Llama 4

Meta's larger open-weight Llama 4 MoE model for multimodal assistants and controlled deployments.

multimodal1M ctxOpen

Llama 4 Scout

Meta · Llama 4

Meta's efficiency-focused Llama 4 MoE model with a headline 10M-token context window.

multimodal10M ctxOpen

MiniMax M3

MiniMax · MiniMax M

MiniMax's 1M-context native multimodal open-weight model for coding and long-horizon agents.

Mistral Large 3

Mistral AI · Mistral Large

Mistral's flagship open-weight European multimodal model with long context and competitive enterprise API economics.

multimodal256K ctxOpen

Jun 20, 2026

Mistral Medium 3.5

Mistral AI · Mistral Medium

Mistral's dense 128B Medium 3.5, a frontier-class multimodal model unifying chat, reasoning, and coding behavior.

multimodal256K ctxOpen

Jun 20, 2026

Mistral OCR 4

Mistral AI · Mistral OCR

Mistral's document model for structured extraction, bounding boxes, and confidence-aware pipelines.

multimodalN/A

Muse Spark

Meta · Muse

Meta's multimodal reasoning model powering Meta AI, with tool use, parallel-agent thinking, voice, and Muse Image integration.

multimodalN/APreview

Qwen3.6-27B

Alibaba · Qwen3

Alibaba's open-weight 27B Qwen 3.6 model for agentic coding, vision-language work, and self-hosted deployment.

multimodal262K ctxOpen

Image Models

GPT Image 1.5

OpenAI · GPT Image

Previous premium GPT Image tier retained for compatibility after GPT Image 2 became the current high-quality route.

GPT Image 2

OpenAI · GPT Image

OpenAI's current state-of-the-art GPT Image model behind ChatGPT Images 2.0 and API image generation.

gpt-image-1-mini

OpenAI · GPT Image

Lower-cost GPT Image tier for product teams that need image generation at higher volume.

grok-imagine-image

xAI · Grok Imagine

xAI's standard Grok Imagine image model for API image generation and editing workflows.

grok-imagine-image-1212

xAI · Grok Imagine

Older Grok Imagine image-generation model ID retained as a legacy reference for pre-refresh integrations.

imageN/ALegacy

grok-imagine-image-quality

xAI · Grok Imagine

xAI's Grok Imagine Quality Mode image model for higher-realism, stronger text rendering, and brand-controlled visuals.

Muse Image

Meta · Muse

Meta's agentic image model for generation, editing, multi-reference composition, and tool-assisted refinement.

Nano Banana 2

Google · Nano Banana

Google's stable Gemini 3.1 Flash Image workhorse for efficient image generation and editing.

Nano Banana 2 Lite

Google · Nano Banana

Google's fastest and cheapest Gemini 3.1 Flash image route for high-volume 1K generation.

Nano Banana Pro

Google · Nano Banana

Google's stable Gemini 3 Pro Image model for professional visual assets and grounded image generation.

Video Models

Gemini Omni Flash

Google · Gemini Omni

Google's preview Gemini Omni API model for multimodal video generation and conversational video editing.

videoN/APreview

grok-imagine-video

xAI · Grok Imagine

xAI's Grok Imagine video model for text-to-video, image-to-video, video editing, and short creative clips.

videoN/A

grok-imagine-video-1212

xAI · Grok Imagine

Older Grok Imagine video-generation model ID retained as a legacy reference for pre-refresh integrations.

videoN/ALegacy

Runway Gen-4.5

Runway · Runway Gen

Runway's current flagship video model for text-to-video and image-to-video creative generation.

videoN/A

Veo 3.1

Google · Veo

Google's active Veo 3.1 preview family for video generation with native audio and stronger control.

videoN/APreview

Audio Models

Eleven v3

ElevenLabs · Eleven

ElevenLabs' generally available expressive text-to-speech model for premium voice and dialogue output.

audioN/A

Gemini 2.5 Pro TTS Preview

Google · Gemini 2.5

Google's 2.5 Pro TTS preview tier for natural, steerable one-way speech generation.

audioN/APreview

Gemini 3.1 Flash TTS Preview

Google · Gemini 3.1

Google's expressive Gemini 3.1 Flash TTS preview, an API-first text-to-speech model spanning 70+ languages.

audio8K ctxPreview

Gemini 3.5 Live Translate

Google · Gemini 3.5

Google's realtime speech-to-speech translation model for the Gemini Live API.

audio131K ctxPreview

GPT-4o mini Transcribe

OpenAI · GPT-4o Audio

Lower-cost OpenAI speech-to-text tier for high-volume transcription pipelines.

GPT-4o mini TTS

OpenAI · GPT-4o Audio

OpenAI text-to-speech model for responsive, API-first voice output workflows.

audio2K ctx

GPT-4o Transcribe

OpenAI · GPT-4o Audio

OpenAI speech-to-text model tier for production transcription and voice pipeline workflows.

GPT-realtime-1.5

OpenAI · GPT Realtime

OpenAI's earlier realtime voice model for audio-in, audio-out agents, now superseded by GPT-Realtime-2 for new flagship voice work.

audio32K ctxLegacy

GPT-Realtime-2

OpenAI · GPT Realtime

OpenAI's GPT-5-class realtime voice model for reasoning, tool-using speech agents, and live support workflows.

audio128K ctx

GPT-Realtime-Translate

OpenAI · GPT Realtime

OpenAI's realtime speech-to-speech translation model for live multilingual audio experiences.

GPT-Realtime-Whisper

OpenAI · GPT Realtime

OpenAI's streaming speech-to-text model for low-latency realtime transcription.

Lyria 2

Google · Lyria

Earlier Google music generation route retained as reference while newer Lyria 3 surfaces take the spotlight.

audioN/ALegacy

Scribe v2

ElevenLabs · Scribe

ElevenLabs' current speech-to-text model line for accurate batch transcription and low-latency realtime STT.

audioN/A

Embedding Models

Gemini Embedding 2

Google · Gemini Embedding

Google's first multimodal Gemini embedding model, mapping text, images, audio, video, and PDFs into a unified vector space.

embedding8K ctx