Skip to content
Dashboard

Claude Haiku 4.5

Claude Haiku 4.5 matches Claude Sonnet 4's performance on coding, computer use, and agent tasks at substantially lower cost and faster speeds, making it a high-capability model in the Haiku line for high-throughput agentic deployments.

Explicit CachingFile InputReasoningTool UseVision (Image)Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'anthropic/claude-haiku-4.5',
prompt: 'Why is the sky blue?'
})

Playground

Try out Claude Haiku 4.5 by Anthropic. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

anthropic logo
anthropic logo

Ask Claude Haiku 4.5 anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Anthropic
200K
0.5s
98tps
$1.00/M$5.00/M
Read:$0.1/M
Write:
$1.25/M
$10.00/K
+ input costs
—
+4
10/15/2025
Amazon Bedrock
200K
0.8s
111tps
$1.00/M$5.00/M
Read:$0.1/M
Write:
$1.25/M
——
+4
10/15/2025
Google Vertex AI
200K
0.9s
100tps
$1.00/M$5.00/M
Read:$0.1/M
Write:
$1.25/M
$10.00/K
+ input costs
—
+4
10/15/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Anthropic

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.8s
111tps
$5.00/MFast $10.00/M
$25.00/MFast $50.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
05/28/2026
1M
0.9s
94tps
$5.00/MFast $30.00/M
$25.00/MFast $150.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
04/16/2026
1M
0.9s
75tps
$3.00/M$15.00/M
Read:$0.3/M
Write:
$3.75/M
$10/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
02/17/2026
1M
1.4s
61tps
$5.00/MFast $30.00/M
$25.00/MFast $150.00/M
Read:$0.5/M
Write:
$6.25/M
$10/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
02/05/2026
200K
0.6s
55tps
$5.00/M$25.00/M
Read:$0.5/M
Write:
$6.25/M
$10.00/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
11/24/2025
1M
0.7s
60tps
$3.00/M
$15.00/M
Read:
$0.3/M
Write:
$3.75/M
$10.00/K
+ input costs
—
+4
anthropic logo
bedrock logo
vertexAnthropic logo
09/29/2025

About Claude Haiku 4.5

Claude Haiku 4.5 became available on AI Gateway on October 15, 2025. It matches Sonnet 4's performance on coding, computer use, and agent tasks at lower cost and faster speeds. Earlier Haiku generations told a similar story relative to their era's highest-tier release; Claude Haiku 4.5 applies it to the Claude 4 generation.

Workloads that previously required a Sonnet 4 deployment can now be evaluated against Claude Haiku 4.5. For organizations running large agent fleets or high-frequency pipelines, the cost differential between Haiku and Sonnet tiers compounds at scale.

Access Claude Haiku 4.5 by setting the model string to anthropic/claude-haiku-4.5 in the AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python. Through AI Gateway, you get observability for tracking usage and cost, bring-your-own-key support, and provider routing across anthropic, bedrock, vertexAnthropic.

Claude Haiku 4.5 inherits the Claude 4 generation's improvements in agentic task execution and instruction following. For teams building with the AI SDK v5, the integration requires only the model string update. No other code changes are needed for the common case.

What To Consider When Choosing a Provider

  • Configuration: For agentic pipelines that deploy Claude Haiku 4.5 as a sub-agent alongside a larger orchestrating model, AI Gateway per-request observability helps attribute token costs across each pipeline stage.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Claude Haiku 4.5

Best For

  • High-volume agentic pipelines: Cost per agent step is a primary architecture constraint and Sonnet 4-level capability is required at scale
  • Computer use tasks: Throughput where Sonnet 4 pricing becomes impractical
  • Sub-agent roles: Fast, capable, low-latency handling of defined subtasks in multi-model pipelines
  • Real-time user-facing features: Code completion, inline suggestions, and live chat where latency sensitivity rules out slower models
  • Coding workflows: Benchmark parity with Sonnet 4 on coding tasks means no capability sacrifice

Consider Alternatives When

  • Deepest reasoning tasks: Claude Sonnet 4.5 or Opus 4-tier models handle the most complex multi-step problem solving
  • Extended or adaptive thinking: Thinking modes for harder reasoning are unavailable on this tier
  • Maximum context window: Check the context specs on this page against tasks processing extremely large inputs
  • Haiku-tier ceiling: Task profiles that exceed Haiku-tier capability require a larger model

Conclusion

Claude Haiku 4.5 shifts the speed-cost tradeoff in agentic deployments. It matches a more capable baseline (Sonnet 4) rather than serving as the affordable option. If you run agent infrastructure at scale, benchmark Claude Haiku 4.5 against Sonnet-tier deployments you run in production before assuming you need an upgrade tier.