GPT 5.4 Nano
GPT 5.4 Nano is the smallest and most affordable model in the GPT-5.4 family, performing close to GPT-5.4 Mini in evaluations at a lower price point, built for high-volume sub-agent workflows.
import { streamText } from 'ai'
const result = streamText({ model: 'openai/gpt-5.4-nano', prompt: 'Why is the sky blue?'})Playground
Try out GPT 5.4 Nano by OpenAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Ask GPT 5.4 Nano anything to try it out.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by OpenAI
| Model |
|---|
About GPT 5.4 Nano
GPT 5.4 Nano became available on March 17, 2026 on AI Gateway as the smallest and most affordable model in the GPT-5.4 family. It performs close to GPT-5.4 Mini in evaluations while costing less per token, making it well-suited for high-volume use cases where cost scales with the number of parallel calls.
The model supports verbosity and reasoning level parameters, giving you control over how much the model reasons before answering. It's built for sub-agent workflows where multiple smaller models coordinate on parts of a larger task, and its price point makes per-request inference viable at the highest traffic levels.
With a context window of 400K tokens, GPT 5.4 Nano can process substantial inputs even when outputs remain short. For classification, routing, lightweight code checks, and batch processing at scale, it provides GPT-5.4 generation quality at the lowest cost in the family.
What To Consider When Choosing a Provider
- Configuration: GPT 5.4 Nano performs close to GPT-5.4 Mini in evaluations at a lower price point. Choose it when cost scales with the number of parallel calls.
- Configuration: Like GPT-5.4 Mini, it supports verbosity and reasoning level parameters, giving you control over response detail and reasoning depth per request.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use GPT 5.4 Nano
Best For
- High-volume sub-agent workflows: Parallel calls where cost scales with the number of agents
- Classification and routing: Sentiment analysis, intent detection, and request triage at high volume
- Lightweight code tasks: Simple code checks, unused import detection, and quick validations
- Cost-sensitive batch processing: Large-scale inference where per-call cost is the primary constraint
- Pipeline preprocessing: Fast filtering and extraction steps that feed into larger model calls
Consider Alternatives When
- Higher capability needed: GPT-5.4 mini for agentic tasks that require more reliable multi-step completion
- Maximum quality: GPT-5.4 or GPT-5.4 pro for complex reasoning and analysis
- Specialized coding: GPT-5.3 codex for autonomous software engineering
- Deep deliberation: O3 for chain-of-thought reasoning on hard problems
Conclusion
GPT 5.4 Nano brings GPT-5.4 generation quality to the most affordable tier. For high-volume sub-agent workflows, classification, and batch processing through AI Gateway, it provides near-mini performance at a fraction of the cost.