Google Gemini · Pricing Plans

Google Gemini Plans Pricing

Gemini API is offered in three service tiers - Free (limited models, no charge), Paid (higher rate limits, context caching, Batch API at 50% off), and Enterprise (custom security, provisioned throughput, volume discounts). Per-model pricing is per-million tokens with separate input and output rates and multi-band pricing for very long prompts. Multimodal inputs (audio) are priced higher than text/image/video.

7 Plans API Commons Plans

View Source

Generative AILLMGoogleAI Infrastructure

Plans

Free Tier freemium

Free access to a subset of Gemini models (e.g., Gemini 3.1 Flash-Lite Preview, 2.5 Flash) with input and output tokens at no charge. Subject to lower per-minute rate limits than paid tier.

Free input/output tokens (tokens · usage) 0.00 USD

Gemini 3.1 Flash-Lite Preview (free models)
Gemini 2.5 Flash (limited)
Lower rate limits
Best-effort availability

Paid - Gemini 3.1 Pro Preview usage-based

Per-token pricing for Gemini 3.1 Pro Preview with banded pricing for prompts above 200K tokens.

Input tokens (<=200K) (tokens · usage) 2.00 USD

Input tokens (>200K) (tokens · usage) 4.00 USD

Output tokens (<=200K prompt) (tokens · usage) 12.00 USD

Output tokens (>200K prompt) (tokens · usage) 18.00 USD

Gemini 3.1 Pro (Preview)
Long-context banded pricing
Context caching
Batch API (50% off)

Paid - Gemini 3.1 Flash-Lite Preview usage-based

Per-token pricing for Gemini 3.1 Flash-Lite Preview. Audio inputs priced higher than text/image/video.

Input tokens (text / image / video) (tokens · usage) 0.25 USD

Input tokens (audio) (tokens · usage) 0.50 USD

Output tokens (tokens · usage) 1.50 USD

Gemini 3.1 Flash-Lite (Preview)
Multimodal input (text/image/video/audio)
Batch API (50% off)

Paid - Gemini 2.5 Flash usage-based

Per-token pricing for Gemini 2.5 Flash. Audio inputs priced higher than text/image/video.

Input tokens (text / image / video) (tokens · usage) 0.30 USD

Input tokens (audio) (tokens · usage) 1.00 USD

Output tokens (tokens · usage) 2.50 USD

Gemini 2.5 Flash
Multimodal input
Batch API (50% off)

Paid - Gemini 2.5 Flash-Lite usage-based

Per-token pricing for Gemini 2.5 Flash-Lite. Lowest list price for cost-sensitive workloads.

Input tokens (text / image / video) (tokens · usage) 0.10 USD

Input tokens (audio) (tokens · usage) 0.30 USD

Output tokens (tokens · usage) 0.40 USD

Gemini 2.5 Flash-Lite
Lowest cost tier
Batch API (50% off)

Add-on - Google Search Grounding usage-based

Per-search pricing for Google Search grounding requests. First 5,000 prompts/month are free (shared across Gemini 3 models).

Free monthly prompts (requests · month) 0.00 USD

Search grounding overage (requests · month) 14.00 USD

Google Search grounding for Gemini 3
Free 5K prompts/month shared across Gemini 3

Enterprise (Vertex AI) enterprise

Enterprise SKU sold via Google Cloud / Vertex AI with provisioned throughput, custom security, VPC-SC, customer-managed encryption keys, and volume discount commitments. Pricing custom.

Provisioned throughput (contract · month) contact sales USD

Provisioned throughput
VPC-SC + CMEK + data residency controls
Volume / commit discounts
Vertex AI integration

Google Gemini Plans Pricing

Plans

Sources