DeepSeek V3.2

DeepSeek V3.2 is DeepSeek's December 1, 2025 model on AI Gateway. It combines tool use with both reasoning and non-reasoning inference modes for agent-style operations.

Tool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'deepseek/deepseek-v3.2',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out DeepSeek V3.2 by DeepSeek. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

DeepSeek V3.2

Ask DeepSeek V3.2 anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

DeepSeek

128K

0.7s

85tps

$0.28/M

$0.42/M

Read:$0.03/M

Write:—

—

12/01/2025

DeepInfra

164K

0.9s

17tps

$0.26/M

$0.38/M

Read:$0.13/M

Write:—

—

12/01/2025

Novita AI

164K

1.5s

28tps

$0.28/M

$0.42/M

Read:$0.13/M

Write:—

—

12/01/2025

Amazon Bedrock

128K

1.0s

43tps

$0.62/M

$1.85/M

—

12/01/2025

More models by DeepSeek

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

deepseek/deepseek-v4-flash

1.8s

111tps

$0.14/M

$0.28/M

Read:$0.0/M

Write:—

—

04/23/2026

deepseek/deepseek-v4-pro

0.5s

152tps

$1.74/M$0.43/M

$3.48/M$0.87/M

Read:$0.0/M

Write:—

—

04/23/2026

deepseek/deepseek-v3.2-thinking

164K

1.1s

48tps

$0.26/M

$0.38/M

Read:$0.13/M

Write:—

—

12/01/2025

deepseek/deepseek-v3.1-terminus

131K

1.5s

29tps

$0.27/M

$1.00/M

Read:$0.14/M

Write:—

—

09/22/2025

deepseek/deepseek-v3.1

164K

1.0s

38tps

$0.21/M

$0.79/M

Read:$0.13/M

Write:—

—

08/21/2025

deepseek/deepseek-v3

164K

1.2s

38tps

$0.27/M

$1.12/M

Read:$0.14/M

Write:—

—

12/26/2024

About DeepSeek V3.2

DeepSeek V3.2 became available on AI Gateway on December 1, 2025 as the next major iteration of DeepSeek's V3 family. The key capability: the model supports combined thinking and tool use, handling tool calls in both reasoning and non-reasoning modes. This distinguishes it from models where tool use and thinking mode are mutually exclusive, which previously forced developers to choose between the two.

The context window of 163.8K tokens carries over from earlier V3 generation models. Max output is 65.5K tokens in standard chat mode. DeepSeek V3.2 is the general-purpose variant in the V3.2 release, suitable for use cases from chat interfaces to multi-step agent pipelines. For workloads that need maximum reasoning depth and can tolerate higher token consumption, the DeepSeek V3.2 Thinking variant extends reasoning output up to 64,000 tokens but drops tool-use support.

Access through AI Gateway removes the need for a separate provider account. Authentication uses AI Gateway API keys or OIDC tokens, and the AI SDK provides a direct integration path. You can adopt DeepSeek V3.2 without managing DeepSeek platform credentials separately.

What To Consider When Choosing a Provider

Configuration: DeepSeek V3.2 supports tool calls in both reasoning and non-reasoning modes. Test both paths in your integration to confirm your tool schema and response parsing logic handle the output structure from each mode correctly.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use DeepSeek V3.2

Best For

Combined tool and reasoning: Agentic applications that need both tool calling and reasoning in the same pipeline through one endpoint
General-purpose V3.2 workflows: Chat and instruction-following tasks using the V3.2 generation
Drop-in V3.1 upgrade: Production API integrations using the AI SDK or OpenAI-compatible interfaces
Mixed task pipelines: Tool-augmented completions and reasoning chains served from a single endpoint

Consider Alternatives When

Maximum reasoning depth: Use DeepSeek V3.2 Thinking (deepseek-v3.2-thinking) for up to 64K tokens of output when tool use is not needed
Benchmark-level math or code: DeepSeek-R1 remains the dedicated reasoning specialist for math and code reasoning workloads

Conclusion

DeepSeek V3.2 resolves a practical constraint in agent design by supporting tool calls across both reasoning and non-reasoning modes. Available through AI Gateway as of December 1, 2025, it provides a straightforward upgrade path from earlier DeepSeek V3 models.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

DeepSeek V3.2

Playground

Providers

More models by DeepSeek

About DeepSeek V3.2

What To Consider When Choosing a Provider

When to Use DeepSeek V3.2

Best For

Consider Alternatives When

Conclusion