Claude Sonnet 4.6

Claude Sonnet 4.6 brings Opus-approaching intelligence to the Sonnet tier with adaptive thinking, a context window of 1M tokens, strong agentic coding, frontend UI quality, and computer use accuracy improvements, plus MCP support for scaled tool use and interleaved thinking with tool calls.

File InputReasoningTool UseVision (Image)Explicit CachingWeb Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'anthropic/claude-sonnet-4.6',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Claude Sonnet 4.6 by Anthropic. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Claude Sonnet 4.6

Ask Claude Sonnet 4.6 anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Anthropic

1.2s

51tps

$3.00/M

$15.00/M

Read:$0.3/M

Write:

$3.75/M

$10/K

+ input costs

—

02/17/2026

Google Vertex AI

0.8s

54tps

$3.00/M

$15.00/M

Read:$0.3/M

Write:

$3.75/M

$10/K

+ input costs

—

02/17/2026

Amazon Bedrock

0.9s

57tps

$3.00/M

$15.00/M

Read:$0.3/M

Write:

$3.75/M

—

02/17/2026

More models by Anthropic

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

anthropic/claude-opus-4.8

3.0s

112tps

$5.00/MFast $10.00/M

$25.00/MFast $50.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

05/28/2026

anthropic/claude-opus-4.7

0.7s

99tps

$5.00/MFast $30.00/M

$25.00/MFast $150.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

04/16/2026

anthropic/claude-opus-4.6

1.4s

59tps

$5.00/MFast $30.00/M

$25.00/MFast $150.00/M

Read:$0.5/M

Write:

$6.25/M

$10/K

+ input costs

—

02/05/2026

anthropic/claude-opus-4.5

200K

0.6s

50tps

$5.00/M

$25.00/M

Read:$0.5/M

Write:

$6.25/M

$10.00/K

+ input costs

—

11/24/2025

anthropic/claude-haiku-4.5

200K

0.5s

134tps

$1.00/M

$5.00/M

Read:$0.1/M

Write:

$1.25/M

$10.00/K

+ input costs

—

10/15/2025

anthropic/claude-sonnet-4.5

0.8s

60tps

$3.00/M

$15.00/M

Read:

$0.3/M

Write:

$3.75/M

$10.00/K

+ input costs

—

09/29/2025

About Claude Sonnet 4.6

Claude Sonnet 4.6 launched on AI Gateway on February 17, 2026. Anthropic described it as approaching Opus-level intelligence. Claude Sonnet 4.6 advances multiple capabilities the Sonnet tier developed across the 4.x generation: stronger agentic coding, improved code review, better frontend UI quality, and higher computer use accuracy.

Two architecture features define Claude Sonnet 4.6. First, adaptive thinking: the model decides when and how much to reason, rather than requiring a fixed extended thinking budget. Simpler requests get a proportionate response. Complex ones receive deeper reasoning automatically. You don't need to categorize each request in advance. This evolves the thinking mode introduced in Claude 3.7 Sonnet into a smarter, model-directed form.

Second, a 1M-token context window at Sonnet pricing. Claude Sonnet 4.5 was the first Sonnet-tier model to reach 1M tokens; Claude Sonnet 4.6 keeps that as standard. For teams working with large codebases, long document collections, or extended agent histories, Opus-approaching quality at 1M tokens and Sonnet pricing changes what's economically feasible.

MCP (Model Context Protocol) support for scaled tool use means Claude Sonnet 4.6 can participate in large tool ecosystems without specifying individual tools for each interaction. The model proactively executes tasks, delegates to subagents, and parallelizes tool calls.

What To Consider When Choosing a Provider

Configuration: Adaptive thinking calibrates token usage automatically. Pair it with the effort parameter to manage cost on mixed workloads where some requests benefit from deep reasoning and others don't.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Claude Sonnet 4.6

Best For

Agentic coding and code review: Requiring strong intelligence with efficient reasoning, the primary capability improvements in this release
Frontend UI development: Quality of visual output matters, highlighted as an area of improvement over previous Sonnet versions
Computer use workflows: Higher accuracy on GUI automation and screen-driven agents ships as a 4.6 improvement, at Sonnet pricing rather than the Opus tier
Large-context agentic tasks: The context window of 1M tokens enables processing entire codebases or document sets at Sonnet pricing
MCP-based tool environments: The model needs to interact with large, diverse tool ecosystems

Consider Alternatives When

Maximum intelligence ceiling: Claude Opus 4.6 provides full Opus depth with the same 1M context window
Tight latency budgets: Haiku 4.5 is faster and cheaper for well-bounded high-throughput requests
Explicit thinking budgets: Earlier models like Claude 3.7 Sonnet accept a fixed thinking token budget instead of adaptive mode

Conclusion

Claude Sonnet 4.6 combines context of 1M tokens, adaptive thinking, MCP (Model Context Protocol) support, and Opus-approaching intelligence at Sonnet pricing. It's a strong default for agentic coding, large-context analysis, and frontend development.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Claude Sonnet 4.6

Playground

Providers

More models by Anthropic

About Claude Sonnet 4.6

What To Consider When Choosing a Provider

When to Use Claude Sonnet 4.6

Best For

Consider Alternatives When

Conclusion