Gemma 3 1B Pricing, Benchmarks, Latency & Providers


Google AI Studio	--	--	--	--	--

Gemma 3 1B Pricing, Benchmarks, Latency & Providers | Phaseo

Pricing

Weighted provider pricing over the last 30 days, with recent route pricing history below.

Effective pricing

Weighted by routed usage over the last 30 days; external and non-routable providers are excluded.

Weighted input price

Per 1M tokens

Weighted output price

Per 1M tokens


Google AI Studio	--	--	--

No 7-day effective pricing is available for the selected service tier.

Quickstart

Start calling this model with endpoint-specific examples.

Get an API key

Create an API key inSettingsKeysand store it asPHASEO_API_KEY

Keep it server-side, never commit it, and rotate it immediately if exposed.

Send the request

Choose a supported endpoint, pick a main language, then select the example style you want to copy.

Supported endpoints

Supported API reference routes for this model.

POST

/v1/responses

Responses API reference

POST

/v1/chat/completions

Chat Completions API reference

POST

/v1/messages

Messages API reference

Streaming

import Phaseo from '@phaseo/sdk';

const client = new Phaseo({
  apiKey: process.env.PHASEO_API_KEY,
});

const response = await client.generateResponse({
    "model": "google/gemma-3-1b",
    "input": "Give me one fun fact about cURL."
});

const outputText = response.output
  ?.flatMap((item) => item.content ?? [])
  .find((item) => item.type === "output_text")
  ?.text;

console.log(outputText ?? response);

Accepted IDsClick to use and copy

Parameters

Aggregated across active providers for the responses route.

Routing will select a compatible provider when a parameter narrows availability, so this list stays model-facing instead of provider-facing.

View all parameters

Parameter	Description
`temperature`	Controls how random token selection can be.
`top_p`	Applies nucleus sampling by limiting candidates to a probability mass threshold.
`max_tokens`	Caps output length on endpoints and providers that use the max_tokens field name.
`seed`	Requests deterministic sampling when the upstream provider supports seeded generation.
`stop`	Defines one or more sequences that terminate generation early.
`tool_choice`	Controls which tool, if any, the model should call.
`tools`	Defines callable tools or functions the model can invoke.
`response_format`	Requests plain text, JSON, or schema-constrained output formats.
`structured_outputs`	Capability signal for reliable schema-constrained output workflows.

Docs:TypeScript SDK Responses API

Google: Gemma 3 1B

Providers

Providers

Performance

Pricing

Benchmarks

Activity

Apps Using This Model

Model Uptime

Quickstart

About

Subscriptions

Google: Gemma 3 1B

Performance