Skip to content
Dashboard

GLM 4.7 Flash

GLM 4.7 Flash is the speed-optimized variant in Z.ai's GLM-4.7 generation, released January 19, 2026. It delivers faster inference for high-throughput workloads while retaining the coding, tool usage, and conversational improvements introduced in GLM-4.7.

Implicit CachingReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-4.7-flash',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Z.ai
200K
0.9s
127tps
$0.07/M$0.40/M
Read:$0.01/M
Write:
+1
01/19/2026
Amazon Bedrock
200K
0.4s
83tps
$0.07/M$0.40/M
01/19/2026