Claude Haiku
FamilyAnthropic · Claude
Anthropic's fastest Claude line for latency-sensitive, high-volume, and cost-constrained workloads.
fast efficient coding vision tool-use cost-effective model-family
Updated April 18, 2026
Overview
This is a model family overview. For version-specific details, see the individual model entries linked below.
Claude Haiku is the speed-and-cost tier in Anthropic’s lineup. It sacrifices some capability depth compared with Sonnet and Opus but delivers significantly faster responses at a fraction of the price. Haiku is the right choice when throughput, latency, or cost matter more than peak reasoning quality.
Current Latest
Claude Haiku 4.5 remains the efficiency-oriented current reference and the current public Haiku model on Anthropic’s model page.
Strengths
- Very fast response times, well suited for real-time applications
- Lowest per-token cost in the Claude family
- Strong coding and computer-use performance for its size and price
- Multimodal support with tool use and structured-output workflows
- Good fit for subagent or parallelized high-volume work
When to Choose Haiku
Choose Haiku when speed or cost is the primary concern:
- High-volume classification, extraction, and routing tasks
- Real-time chat applications where latency matters most
- Coding subagents and multi-agent orchestration where cost matters
- Bulk content processing pipelines
- Cost-sensitive applications that can tolerate occasional quality drops on hard tasks
Access
- Anthropic API as
claude-haiku-4-5 - AWS Bedrock
- Google Vertex AI
- Microsoft Foundry
- Claude.ai on web, iOS, and Android
- Claude products and Claude Code