Skip to content
Dashboard

DeepSeek V3.2

DeepSeek V3.2 is DeepSeek's December 1, 2025 model on AI Gateway. It combines tool use with both reasoning and non-reasoning inference modes for agent-style operations.

Tool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'deepseek/deepseek-v3.2',
prompt: 'Why is the sky blue?'
})

Playground

Try out DeepSeek V3.2 by DeepSeek. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

deepseek logo
deepseek logo

Ask DeepSeek V3.2 anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepSeek
128K
0.7s
85tps
$0.28/M$0.42/M
Read:$0.03/M
Write:—
——
12/01/2025
DeepInfra
164K
0.9s
17tps
$0.26/M$0.38/M
Read:$0.13/M
Write:—
——
12/01/2025
Novita AI
164K
1.5s
28tps
$0.28/M$0.42/M
Read:$0.13/M
Write:—
——
12/01/2025
Amazon Bedrock
128K
1.0s
43tps
$0.62/M$1.85/M——
12/01/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by DeepSeek

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
1.8s
111tps
$0.14/M$0.28/M
Read:$0.0/M
Write:—
——
+1
azure logo
deepinfra logo
deepseek logo
+2
04/23/2026
1M
0.5s
152tps
$1.74/M$0.43/M
$3.48/M$0.87/M
Read:$0.0/M
Write:—
——
+1
azure logo
baseten logo
deepinfra logo
+4
04/23/2026
164K
1.1s
48tps
$0.26/M$0.38/M
Read:$0.13/M
Write:—
——
+1
bedrock logo
deepinfra logo
fireworks logo
+1
12/01/2025
131K
1.5s
29tps
$0.27/M$1.00/M
Read:$0.14/M
Write:—
——
+1
novita logo
09/22/2025
164K
1.0s
38tps
$0.21/M$0.79/M
Read:$0.13/M
Write:—
——
+1
deepinfra logo
novita logo
sambanova logo
+1
08/21/2025
164K
1.2s
38tps
$0.27/M$1.12/M
Read:$0.14/M
Write:—
——
novita logo
12/26/2024

About DeepSeek V3.2

DeepSeek V3.2 became available on AI Gateway on December 1, 2025 as the next major iteration of DeepSeek's V3 family. The key capability: the model supports combined thinking and tool use, handling tool calls in both reasoning and non-reasoning modes. This distinguishes it from models where tool use and thinking mode are mutually exclusive, which previously forced developers to choose between the two.

The context window of 163.8K tokens carries over from earlier V3 generation models. Max output is 65.5K tokens in standard chat mode. DeepSeek V3.2 is the general-purpose variant in the V3.2 release, suitable for use cases from chat interfaces to multi-step agent pipelines. For workloads that need maximum reasoning depth and can tolerate higher token consumption, the DeepSeek V3.2 Thinking variant extends reasoning output up to 64,000 tokens but drops tool-use support.

Access through AI Gateway removes the need for a separate provider account. Authentication uses AI Gateway API keys or OIDC tokens, and the AI SDK provides a direct integration path. You can adopt DeepSeek V3.2 without managing DeepSeek platform credentials separately.

What To Consider When Choosing a Provider

  • Configuration: DeepSeek V3.2 supports tool calls in both reasoning and non-reasoning modes. Test both paths in your integration to confirm your tool schema and response parsing logic handle the output structure from each mode correctly.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use DeepSeek V3.2

Best For

  • Combined tool and reasoning: Agentic applications that need both tool calling and reasoning in the same pipeline through one endpoint
  • General-purpose V3.2 workflows: Chat and instruction-following tasks using the V3.2 generation
  • Drop-in V3.1 upgrade: Production API integrations using the AI SDK or OpenAI-compatible interfaces
  • Mixed task pipelines: Tool-augmented completions and reasoning chains served from a single endpoint

Consider Alternatives When

  • Maximum reasoning depth: Use DeepSeek V3.2 Thinking (deepseek-v3.2-thinking) for up to 64K tokens of output when tool use is not needed
  • Benchmark-level math or code: DeepSeek-R1 remains the dedicated reasoning specialist for math and code reasoning workloads

Conclusion

DeepSeek V3.2 resolves a practical constraint in agent design by supporting tool calls across both reasoning and non-reasoning modes. Available through AI Gateway as of December 1, 2025, it provides a straightforward upgrade path from earlier DeepSeek V3 models.