Claude Haiku

Family

Anthropic · Claude

Anthropic's fastest Claude line for latency-sensitive, high-volume, and cost-constrained workloads.

fast efficient coding vision tool-use cost-effective model-family
Updated April 18, 2026

Overview

This is a model family overview. For version-specific details, see the individual model entries linked below.

Claude Haiku is the speed-and-cost tier in Anthropic’s lineup. It sacrifices some capability depth compared with Sonnet and Opus but delivers significantly faster responses at a fraction of the price. Haiku is the right choice when throughput, latency, or cost matter more than peak reasoning quality.

Current Latest

Claude Haiku 4.5 remains the efficiency-oriented current reference and the current public Haiku model on Anthropic’s model page.

Strengths

  • Very fast response times, well suited for real-time applications
  • Lowest per-token cost in the Claude family
  • Strong coding and computer-use performance for its size and price
  • Multimodal support with tool use and structured-output workflows
  • Good fit for subagent or parallelized high-volume work

When to Choose Haiku

Choose Haiku when speed or cost is the primary concern:

  • High-volume classification, extraction, and routing tasks
  • Real-time chat applications where latency matters most
  • Coding subagents and multi-agent orchestration where cost matters
  • Bulk content processing pipelines
  • Cost-sensitive applications that can tolerate occasional quality drops on hard tasks

Access

  • Anthropic API as claude-haiku-4-5
  • AWS Bedrock
  • Google Vertex AI
  • Microsoft Foundry
  • Claude.ai on web, iOS, and Android
  • Claude products and Claude Code