Headline benchmark standings and comparison context.
Top benchmark results for google/gemma-3-27b.
Key dates, capabilities, and model metadata.
12 Mar 2025
12 Mar 2025
Parameters
27,000,000,000
License
Gemma
Training Tokens
14,000,000,000,000
Input
Output
Detailed benchmark comparisons now live in the Compare tool.
Public apps observed in gateway request traffic for this model.
Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
Amazon Bedrock | $0.230 | $0.380 | -- |
DeepInfra | $0.080 | $0.160 | -- |
Google AI Studio | -- | -- | -- |
Nebius Token Factory | $0.200 | $0.600 | -- |
NovitaAI | $0.119 | $0.200 | -- |
Venice | $0.120 | $0.200 | -- |
Venice (E2EE) | -- | -- | -- |
Meter
Start calling this model with endpoint-specific examples.
# 1) Set your key
export AI_STATS_API_KEY="aistats_***"
# 2) Send a request
curl -s https://api.phaseo.app/v1/responses \
-H "Authorization: Bearer $AI_STATS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemma-3-27b",
"input": "Give me one fun fact about cURL."
}'Core latency and throughput trends from recent traffic.
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Total Context