Start calling this model with endpoint-specific examples.
Headline benchmark standings and comparison context.
Key dates, capabilities, and model metadata.
glm-5.2 on the general OpenAI-compatible API path in addition to the GLM Coding Plan materials, but the public pricing table still does not publish a dedicated GLM-5.2 token-pricing row and the release-notes page still stops at GLM-5.1.Effective pricing across providers over the past hour and 30-day pricing history by meter.
Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
AtlasCloud | $1.400 | $4.400 | -- |
Baseten | $1.500 | $4.500 | -- |
Cloudflare | $1.400 | $4.400 | -- |
CrofAI | -- | -- | -- |
DeepInfra | $1.400 | $4.400 | -- |
Fireworks | $1.400 | $4.400 | -- |
GMICloud | -- | -- | -- |
Nebius Token Factory | $1.400 | $4.400 | -- |
NovitaAI | $1.400 | $4.400 | -- |
Venice | $1.750 | $5.500 | -- |
Venice (E2EE) | $1.750 | $5.750 | -- |
z.AI | $1.400 | $4.400 | -- |
Meter