Start calling this model with endpoint-specific examples.
Headline benchmark standings and comparison context.
Key dates, capabilities, and model metadata.
glm-5.2 on the general OpenAI-compatible API path in addition to the GLM Coding Plan materials, but the public pricing table still does not publish a dedicated GLM-5.2 token-pricing row and the release-notes page still stops at GLM-5.1.Start calling this model with endpoint-specific examples.
Headline benchmark standings and comparison context.
Key dates, capabilities, and model metadata.
Start calling this model with endpoint-specific examples.
Headline benchmark standings and comparison context.
Key dates, capabilities, and model metadata.
16 Jun 2026
16 Jun 2026
License
MIT
Input
Output
Core latency and throughput trends from recent traffic.
Throughput
16 Jun 2026Latency
16 Jun 2026E2E Latency
16 Jun 2026Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
This provider is not live for the selected tier yet. Pricing will appear once availability starts.
Start calling this model with endpoint-specific examples.
Choose a supported endpoint, pick a main language, then select the example style you want to copy.
import AIStats from '@ai-stats/sdk';
const client = new AIStats({
apiKey: process.env.AI_STATS_API_KEY,
});
const response = await client.generateResponse({
"model": "z-ai/glm-5.2",
"input": "Give me one fun fact about cURL.",
"service_tier": "standard"
});
const outputText = response.output
?.flatMap((item) => item.content ?? [])
.find((item) => item.type === "output_text")
?.text;
console.log(outputText ?? response);Parameters
Aggregated across active providers for the responses route.
Routing will select a compatible provider when a parameter narrows availability, so this list stays model-facing instead of provider-facing.
| Parameter | Description |
|---|---|
max_tokens | Caps output length on endpoints and providers that use the max_tokens field name. |
tools | Defines callable tools or functions the model can invoke. |
temperature | Controls how random token selection can be. |
top_p | Applies nucleus sampling by limiting candidates to a probability mass threshold. |
stop | Defines one or more sequences that terminate generation early. |
tool_choice | Controls which tool, if any, the model should call. |
response_format | Requests plain text, JSON, or schema-constrained output formats. |
structured_outputs | Capability signal for reliable schema-constrained output workflows. |
reasoning | Provider-specific reasoning configuration for reasoning-capable APIs. |
include_reasoning | Requests reasoning content or reasoning summaries in responses where supported. |
Select a route to update the request snippet and compatibility details.
/v1/responses/v1/chat/completions/v1/messages