Headline benchmark standings and comparison context.
Top benchmark results for google/gemma-3-12b.
Detailed benchmark comparisons now live in the Compare tool.
Key dates, capabilities, and model metadata.
12 Mar 2025
12 Mar 2025
Parameters
12,000,000,000
License
Gemma
Training Tokens
12,000,000,000,000
Input
Output
Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
Amazon Bedrock | $0.090 | $0.290 | -- |
DeepInfra | $0.040 | $0.130 | -- |
Google AI Studio | -- | -- | -- |
Meter
Public apps observed in gateway request traffic for this model.
Core latency and throughput trends from recent traffic.
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Start calling this model with endpoint-specific examples.
# 1) Set your key
export AI_STATS_API_KEY="aistats_***"
# 2) Send a request
curl -s https://api.phaseo.app/v1/responses \
-H "Authorization: Bearer $AI_STATS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemma-3-12b",
"input": "Give me one fun fact about cURL."
}'