← AI Hub

Models

Technical model profiles and strategy explainers — capabilities, deployment tradeoffs, and practical fit guidance.

AI model pages are point-in-time snapshots based on each page's last verified date. Current and preview entries are refreshed on the active maintenance cadence, while legacy and deprecated entries remain browseable as historical context.

Filters available

Filter by type, provider, status, and open-source availability. Deprecated entries stay hidden unless enabled.

Model Strategy Explainers

Constraint-led guidance for open-weight and proprietary choices across local, private, and managed deployment paths.

Model Families

Stable overviews of major model product lines. Use these as durable reference points.

Language Models

DeepSeek V4 Flash

DeepSeek · DeepSeek V4

DeepSeek's low-cost V4 API route for 1M-context production assistants, agents, and compatibility migrations.

language1M ctxOpen
May 16, 2026

DeepSeek V4 Pro

DeepSeek · DeepSeek V4

DeepSeek's stronger V4 route for million-token reasoning, agentic coding, and higher-end open-weight workloads.

language1M ctxOpen
May 16, 2026

Devstral 2

Mistral AI · Devstral

Mistral's open-weight coding model built for agentic software work and terminal-first execution.

language256K ctxOpen
Apr 13, 2026

GLM-5

Zhipu AI · GLM

Zhipu's GLM long-context model with strong coding ability and open-weight plus API access.

language200K ctxOpen
Apr 4, 2026

GLM-5.1

Z.ai · GLM

Z.ai's GLM-5.1, a 744B-parameter MoE open-weight model with strong autonomous coding and tool-use behavior.

language203K ctxOpen
May 1, 2026

GPT-5

OpenAI · GPT-5

Original GPT-5 release entry, now superseded by newer GPT-5.3 and GPT-5.4 generation variants.

language400K ctxLegacy
Mar 27, 2026

GPT-5 mini

OpenAI · GPT-5

Cost-efficient GPT-5 variant for high-volume production workflows needing strong reasoning at lower cost.

language400K ctx
Apr 4, 2026

GPT-5 nano

OpenAI · GPT-5

Ultra-low-cost GPT-5 tier for high-throughput automation and lightweight reasoning tasks.

language400K ctx
Apr 4, 2026

GPT-5-Codex

OpenAI · GPT-5

Earlier GPT-5 Codex release entry kept as a historical baseline in OpenAI's Codex model lineage.

language400K ctxLegacy
Apr 18, 2026

GPT-5.2

OpenAI · GPT-5

Current GPT-5 family flagship in OpenAI's API guide for coding, agentic, and general professional work.

language400K ctx
Mar 27, 2026

GPT-5.2-Codex

OpenAI · GPT-5

Current GPT-5.2 coding model for long-horizon software engineering and agentic repository work.

language400K ctx
Apr 18, 2026

GPT-5.2-Pro

OpenAI · GPT-5

Current premium GPT-5.2 tier for higher-precision API work when standard GPT-5.2 is not enough.

language400K ctx
Mar 27, 2026

GPT-5.3

OpenAI · GPT-5

Current GPT-5.3 Instant / Chat route for everyday ChatGPT work and API chat-style testing.

language128K ctx
Mar 27, 2026

GPT-5.3-Codex

OpenAI · GPT-5

Specialized GPT-5.3 Codex model for long-horizon agentic software engineering.

language400K ctx
Apr 18, 2026

GPT-5.4

OpenAI · GPT-5

API-ready GPT-5 premium model for difficult professional work, tool use, and computer-assisted tasks.

language1M ctx
Apr 26, 2026

GPT-5.4 mini

OpenAI · GPT-5

OpenAI's strongest mini model for coding, computer use, and fast high-volume agent workloads.

language400K ctx
Apr 4, 2026

GPT-5.4 nano

OpenAI · GPT-5

OpenAI's cheapest GPT-5.4 route for fast classification, extraction, and lightweight coding subagents.

language400K ctx
Apr 4, 2026

GPT-5.4-Pro

OpenAI · GPT-5

API-ready premium GPT-5 escalation tier for decision-ready analysis and demanding professional workflows.

language1M ctx
Apr 26, 2026

GPT-5.5

OpenAI · GPT-5

OpenAI's newest GPT-5.5 route for agentic coding, professional work, research, and computer-use tasks.

language1M ctx
Apr 26, 2026

GPT-5.5 Pro

OpenAI · GPT-5

Premium GPT-5.5 route for the hardest ChatGPT reasoning, research, and professional workflows.

language1M ctx
Apr 26, 2026

GPT-Rosalind

OpenAI · GPT

OpenAI's life-sciences research preview model for biology, drug discovery, and tool-heavy scientific workflows.

languageN/APreview
Apr 18, 2026

Grok 4.20

xAI · Grok

xAI's 2M-context preview lane for enterprise research and multi-agent experiments, while Grok 4.3 remains the default API caller.

language2M ctxPreview
May 8, 2026

Kimi K2.5

Moonshot AI · Kimi

Moonshot's Kimi K2.5 is an open-weight long-context model focused on agentic reasoning and tool use.

language256K ctxOpen
Apr 4, 2026

MiniMax M2.5

MiniMax · MiniMax M

MiniMax's M2.5, a fast and inexpensive proprietary model for agentic coding, tool use, and high-volume production.

language200K ctx
May 1, 2026

MiniMax M2.7

MiniMax · MiniMax M

MiniMax's M2.7 agentic productivity model for coding, office workflows, tool use, and low-cost long-context execution.

language205K ctx
May 16, 2026

MiniMax-M2.5

MiniMax · MiniMax M

MiniMax's still-active M-series model for coding, tool use, and office-style agent workflows.

language205K ctx
May 16, 2026

Mistral Small 3.2

Mistral AI · Mistral Small

Mistral's open 24B model balancing strong instruction quality with low API cost for production assistants.

language128K ctxOpen
Apr 4, 2026

Mistral Small 4

Mistral AI · Mistral Small

Mistral's new open-weight small model for efficient long-context assistants and coding support.

language256K ctxOpen
Apr 13, 2026

o3

OpenAI · o-series

Still-available OpenAI reasoning model for difficult analysis, retained as a legacy reference after GPT-5 became the default recommendation.

language200K ctxLegacy
Mar 27, 2026

o3-deep-research

OpenAI · o-series

OpenAI's highest-capability deep research model for long, source-heavy investigations over web and private data.

language200K ctx
Apr 8, 2026

o4-mini

OpenAI · o-series

Cost-efficient OpenAI reasoning model retained as a legacy API reference after GPT-5 mini became the newer default direction.

language200K ctxLegacy
Mar 27, 2026

o4-mini-deep-research

OpenAI · o-series

Lower-cost OpenAI deep research model for source-heavy investigations when throughput and budget matter.

language200K ctx
Apr 8, 2026

OpenAI Privacy Filter

OpenAI · OpenAI Privacy Filter

OpenAI's Apache 2.0 open-weight model for local PII detection and redaction workflows.

language128K ctxOpenPreview
Apr 24, 2026

Qwen 3.6 Max Preview

Alibaba · Qwen3

Alibaba's flagship Qwen 3.6 Max Preview, a sparse MoE model for agentic coding, tool use, and long-context reasoning.

language262K ctxPreview
May 1, 2026

Qwen3-Max

Alibaba · Qwen3

Alibaba's top Qwen API model for high-end multilingual reasoning, coding, and enterprise assistant workloads.

language262K ctx
Apr 4, 2026

Qwen3.5

Alibaba · Qwen

Alibaba's Qwen3.5 generation extends the Qwen line with stronger open-weight reasoning and coding performance.

language262K ctxOpen
Apr 4, 2026

Multimodal Models

Claude Haiku 4.5

Anthropic · Claude 4

Fast and efficient Claude tier for latency-sensitive assistant and automation workloads.

multimodal200K ctx
Apr 18, 2026

Claude Mythos Preview

Anthropic · Claude Mythos

Anthropic's gated research-preview frontier model for defensive cybersecurity, autonomous coding, and long-running agents.

multimodal1M ctxPreview
Apr 8, 2026

Claude Opus 4.6

Anthropic · Claude 4

Superseded Claude 4.6 snapshot for high-difficulty reasoning, coding, and long-running agent workflows.

multimodal200K ctxLegacy
Apr 18, 2026

Claude Opus 4.7

Anthropic · Claude 4

Anthropic's April 2026 premium Claude model, now superseded by Opus 4.8 but still useful for pinned deployments.

multimodal1M ctxLegacy
May 29, 2026

Claude Opus 4.8

Anthropic · Claude 4

Anthropic's May 2026 premium Claude model for long-horizon agentic coding, complex reasoning, and high-autonomy work.

multimodal1M ctx
May 29, 2026

Claude Sonnet 4.5

Anthropic · Claude 4

Balanced Claude tier for production reasoning, coding, and long-context assistant workflows.

multimodal200K ctx
Mar 27, 2026

Claude Sonnet 4.6

Anthropic · Claude 4

Anthropic's balanced Claude model — strong reasoning and coding at moderate pricing, the default recommendation for most tasks.

multimodal200K ctx
Apr 18, 2026

Computer Use Preview

Google · Gemini Computer Use

Google's preview computer-use model surface for browser and interface control workflows.

multimodal128K ctxPreview
Apr 4, 2026

Gemini 2.5 Flash

Google · Gemini 2.5

Stable Gemini 2.5 Flash route balancing multimodal capability, latency, and production cost.

multimodal1M ctx
May 16, 2026

Gemini 2.5 Flash Live Preview

Google · Gemini 2.5

Google's stable 2.5-era native-audio Live API model for realtime multimodal voice agents.

multimodal131K ctxPreview
May 16, 2026

Gemini 2.5 Flash-Lite

Google · Gemini 2.5

Stable budget Gemini 2.5 tier for large-scale assistant and automation workloads.

multimodal1M ctx
May 16, 2026

Gemini 2.5 Pro

Google · Gemini 2.5

Stable high-capability Gemini 2.5 tier for long-context multimodal reasoning and enterprise workflows.

multimodal1M ctx
May 16, 2026

Gemini 3 Flash

Google · Gemini 3

Google's older Gemini 3 preview Flash route, now superseded by stable Gemini 3.5 Flash for most new fast-model work.

multimodal1M ctxPreview
May 24, 2026

Gemini 3.1 Flash Live Preview

Google · Gemini 3.1

Google's low-latency Gemini 3.1 live model for realtime audio-to-audio and multimodal dialogue.

multimodal131K ctxPreview
Apr 13, 2026

Gemini 3.1 Flash-Lite

Google · Gemini 3.1

Google's newer low-cost Gemini preview tier for high-throughput multimodal assistant and automation workloads.

multimodal1M ctxPreview
Apr 4, 2026

Gemini 3.1 Pro Preview

Google · Gemini 3.1

Google's current premium Gemini 3.1 preview model for multimodal reasoning, coding, long-context analysis, and agentic workflows.

multimodal1M ctxPreview
May 16, 2026

Gemini 3.5 Flash

Google · Gemini 3.5

Google's stable Gemini 3.5 Flash model for fast frontier multimodal, coding, and long-horizon agent workflows.

multimodal1M ctx
May 24, 2026

Gemini Robotics-ER 1.6

Google · Gemini Robotics

Google DeepMind's robotics-tuned Gemini for embodied reasoning, spatial planning, and physical agent tasks.

multimodal1M ctxPreview
May 1, 2026

GPT-4.1

OpenAI · GPT-4.1

Long-context multimodal model retained as a legacy reference after retirement from ChatGPT defaults.

multimodal1M ctxLegacy
Mar 27, 2026

GPT-4o

OpenAI · GPT-4o

Widely deployed multimodal model kept as a legacy reference after retirement from ChatGPT defaults.

multimodal128K ctxLegacy
Mar 27, 2026

GPT-4o mini

OpenAI · GPT-4o

Lower-cost GPT-4o API tier for high-volume text-plus-image assistant and automation workloads.

multimodal128K ctx
May 16, 2026

Grok 4.3

xAI · Grok

xAI's recommended primary Grok caller and post-retirement redirect target for reasoning, non-reasoning, coding, and long-context agent work.

multimodal1M ctx
May 16, 2026

Kimi K2.6

Moonshot AI · Kimi

Moonshot AI's Kimi K2.6, a 1T-parameter MoE open-weight model for long-horizon coding and agentic workflows.

multimodal262K ctxOpen
May 1, 2026

Llama 4 Maverick

Meta · Llama 4

Meta's larger open-weight Llama 4 MoE model for multimodal assistants and controlled deployments.

multimodal1M ctxOpen
May 16, 2026

Llama 4 Scout

Meta · Llama 4

Meta's efficiency-focused Llama 4 MoE model with a headline 10M-token context window.

multimodal10M ctxOpen
May 16, 2026

Mistral Large 3

Mistral AI · Mistral Large

Mistral's flagship open-weight European multimodal model with long context and competitive enterprise API economics.

multimodal256K ctxOpen
Apr 4, 2026

Mistral Medium 3.5

Mistral AI · Mistral Medium

Mistral's dense 128B Medium 3.5, a frontier-class multimodal model unifying chat, reasoning, and coding behavior.

multimodal256K ctxOpen
May 1, 2026

Muse Spark

Meta · Muse

Meta's first Muse-family model for Meta AI, combining multimodal reasoning, tool use, and parallel-agent test-time thinking.

multimodalN/APreview
Apr 8, 2026

Qwen3.6-27B

Alibaba · Qwen3

Alibaba's open-weight 27B Qwen 3.6 model for agentic coding, vision-language work, and self-hosted deployment.

multimodal262K ctxOpen
May 16, 2026

Image Models

GPT Image 1.5

OpenAI · GPT Image

OpenAI's previous premium GPT Image tier for higher-fidelity generation and iterative editing workflows.

imageN/A
Apr 24, 2026

GPT Image 2

OpenAI · GPT Image

OpenAI's current state-of-the-art GPT Image model behind ChatGPT Images 2.0 and API image generation.

imageN/A
Apr 24, 2026

GPT-Image-1

OpenAI · GPT Image

OpenAI image generation model for prompt-driven creation and iterative editing workflows.

imageN/A
Apr 4, 2026

gpt-image-1-mini

OpenAI · GPT Image

Lower-cost GPT Image tier for product teams that need image generation at higher volume.

imageN/A
Apr 4, 2026

grok-imagine-image

xAI · Grok Imagine

xAI's standard Grok Imagine image model for API image generation and editing workflows.

imageN/A
May 16, 2026

grok-imagine-image-1212

xAI · Grok Imagine

Older Grok Imagine image-generation model ID retained as a legacy reference for pre-refresh integrations.

imageN/ALegacy
May 8, 2026

grok-imagine-image-quality

xAI · Grok Imagine

xAI's Grok Imagine Quality Mode image model for higher-realism, stronger text rendering, and brand-controlled visuals.

imageN/A
May 8, 2026

Imagen 4

Google · Imagen

Google's current Imagen 4 image generation tier for API-backed visual creation and design workflows.

imageN/APreview
Mar 27, 2026

Imagen 4 Fast

Google · Imagen

Google's lower-latency Imagen 4 tier for faster image generation in Gemini API workflows.

imageN/A
Apr 4, 2026

Nano Banana 2

Google · Nano Banana

Google's latest fast image-generation release combining higher fidelity, stronger reasoning, and Flash-speed iteration.

imageN/APreview
May 16, 2026

Nano Banana Pro

Google · Nano Banana

Google's Gemini 3 Pro Image preview model for professional visual assets, grounded image generation, and high-fidelity text rendering.

imageN/APreview
May 16, 2026

Video Models

Audio Models

Eleven v3

ElevenLabs · Eleven

ElevenLabs' generally available expressive text-to-speech model for premium voice and dialogue output.

audioN/A
May 16, 2026

Gemini 2.5 Pro TTS Preview

Google · Gemini 2.5

Google's 2.5 Pro TTS preview tier for natural, steerable one-way speech generation.

audioN/APreview
May 16, 2026

Gemini 3.1 Flash TTS Preview

Google · Gemini 3.1

Google's expressive Gemini 3.1 Flash TTS preview, an API-first text-to-speech model spanning 70+ languages.

audio32K ctxPreview
May 1, 2026

GPT-4o mini Transcribe

OpenAI · GPT-4o Audio

Lower-cost OpenAI speech-to-text tier for high-volume transcription pipelines.

audio16K ctx
May 16, 2026

GPT-4o mini TTS

OpenAI · GPT-4o Audio

OpenAI text-to-speech model for responsive, API-first voice output workflows.

audio2K ctx
May 16, 2026

GPT-4o Transcribe

OpenAI · GPT-4o Audio

OpenAI speech-to-text model tier for production transcription and voice pipeline workflows.

audio16K ctx
May 16, 2026

GPT-realtime-1.5

OpenAI · GPT Realtime

OpenAI's earlier realtime voice model for audio-in, audio-out agents, now superseded by GPT-Realtime-2 for new flagship voice work.

audio32K ctxLegacy
May 8, 2026

GPT-Realtime-2

OpenAI · GPT Realtime

OpenAI's GPT-5-class realtime voice model for reasoning, tool-using speech agents, and live support workflows.

audio128K ctx
May 8, 2026

GPT-Realtime-Translate

OpenAI · GPT Realtime

OpenAI's realtime speech-to-speech translation model for live multilingual audio experiences.

audio16K ctx
May 8, 2026

GPT-Realtime-Whisper

OpenAI · GPT Realtime

OpenAI's streaming speech-to-text model for low-latency realtime transcription.

audio16K ctx
May 8, 2026

Lyria 2

Google · Lyria

Earlier Google music generation route retained as reference while newer Lyria 3 surfaces take the spotlight.

audioN/ALegacy
Mar 27, 2026

Embedding Models