Gemini API Pricing — All Google Model Costs (2026)

Complete Gemini API cost breakdown for every Google model. From the ultra-efficient Flash Lite at $0.075/1M tokens to the powerful Pro series with breakpoint pricing, here is every Gemini model's cost updated for 2026. Rates sourced from the official Google Gemini pricing page. Compare with Anthropic Claude and OpenAI, or see all models in our LLM pricing comparison.

All Gemini Models

Model Input $/1M tokens Output $/1M tokens Type Notes
gemini-2.0-flash $0.100 $0.400 Flat
gemini-2.0-flash-lite $0.075 $0.300 Flat
gemini-2.5-computer-use $1.25 / $2.50 $10.00 / $15.00 Breakpoint Threshold: 200K tokens
gemini-2.5-flash $0.300 $2.50 Flat
gemini-2.5-flash-image $0.300 $2.50 Multimodal text, image
gemini-2.5-flash-lite $0.100 $0.400 Flat
gemini-2.5-flash-native-audio $0.500 $2.00 Multimodal text, audio
gemini-2.5-flash-preview-tts $0.500 $10.00 Flat
gemini-2.5-pro $1.25 / $2.50 $10.00 / $15.00 Breakpoint Threshold: 200K tokens
gemini-2.5-pro-preview-tts $1.00 $20.00 Flat
gemini-3-flash $0.500 $3.00 Flat
gemini-3-pro-image-preview $2.00 $12.00 Multimodal text, image
gemini-3-pro-preview $2.00 / $4.00 $12.00 / $18.00 Breakpoint Threshold: 200K tokens
gemini-3.1-pro-preview $2.00 / $4.00 $12.00 / $18.00 Breakpoint Threshold: 200K tokens

Gemini API Cost Breakdown

Gemini API costs depend on which model you use and how many tokens you process. The cheapest option is Gemini 2.0 Flash Lite at just $0.075 per million input tokens — one of the lowest-cost LLM APIs on the market. Mid-tier models like Gemini 2.5 Flash offer stronger reasoning at $0.30/1M input, while Gemini 2.5 Pro starts at $1.25/1M input for complex tasks.

For most workloads, output tokens cost 2-8x more than input tokens. Gemini API pricing is competitive with both OpenAI and Anthropic, especially at the budget and mid-range tiers. Use our LLM cost calculator to estimate your specific Gemini API costs.

How Gemini Pricing Works

Gemini models use three pricing structures. Flat pricing charges a fixed rate per million tokens regardless of context length. This applies to Flash and Flash Lite models, making costs simple and predictable.

Breakpoint pricing applies to Pro-tier models like Gemini 2.5 Pro and Gemini 3 Pro. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold, letting you benefit from lower costs for typical workloads.

Multimodal pricing is used by image and audio models. These have separate rates for each modality — for example, Gemini 3 Pro Image charges $2/1M for text output but $120/1M for image output tokens.

Gemini vs Claude vs GPT: Price Comparison

How does Gemini API pricing stack up against Claude and GPT? Here's a side-by-side comparison at each price tier. See our full LLM pricing comparison for all models.

TierModelInput / 1MOutput / 1M
BudgetGoogle gemini-2.0-flash-lite$0.07$0.30
Anthropic claude-haiku-3$0.25$1.25
OpenAI gpt-4.1-nano$0.10$0.40
Mid-rangeGoogle gemini-2.5-flash$0.30$2.50
Anthropic claude-haiku-4-5$1.00$5.00
OpenAI gpt-5-mini$0.25$2.00
FlagshipGoogle gemini-2.5-pro$1.25$10.00
Anthropic claude-sonnet-4-5$3.00$15.00
OpenAI gpt-5$1.25$10.00

Gemini API Monthly Cost Estimates

How much will the Gemini API cost you per month? Google's pricing is among the most competitive, especially at the budget tier. Use our LLM cost calculator for a precise estimate.

Light Use

$1-10/mo

  • Personal projects
  • <1K requests/day
  • Flash Lite for most tasks

Medium Use

$10-75/mo

  • Small team apps
  • 1-5K requests/day
  • Mix of Flash and Pro

Heavy Use

$75-400/mo

  • Production apps
  • 5-20K requests/day
  • Pro for quality tasks

Enterprise

$400+/mo

  • Large-scale deployments
  • 20K+ requests/day
  • Pro with multimodal

Which Gemini Model Should You Use?

Google's model lineup covers everything from ultra-cheap inference to multimodal generation. Here's how to pick the right Gemini model for your use case.

Use CaseRecommended ModelEst. Monthly CostWhy This Model
High-volume classificationFlash Lite 2.0$1-10One of the cheapest LLM APIs available
General chatbotFlash 2.5$10-50Strong reasoning at very low cost
Code & complex reasoningGemini 2.5 Pro$30-150Best Gemini model for hard tasks
Image generationGemini 3 Pro Image$50-300Native multimodal output
Real-time applicationsFlash 2.0$3-20Fastest inference, low latency

5 Ways to Reduce Your Gemini API Costs

1

Use Flash Lite for simple tasks

At $0.075/1M input tokens, Flash Lite 2.0 is 17x cheaper than Gemini 2.5 Pro. Use it for classification, extraction, and any task that doesn't need deep reasoning.

2

Stay under the 200K breakpoint

Gemini Pro models double their input cost above 200K tokens. Split long documents into chunks or use Flash models for long-context tasks.

3

Take advantage of the free tier

Google offers a generous free tier for Gemini API through AI Studio. For low-volume projects, you may not need to pay anything.

4

Be mindful of multimodal costs

Image output tokens on Gemini 3 Pro Image cost significantly more than text. Only use multimodal models when you actually need image or audio output.

5

Monitor usage with the ModelPricing API

Track your Gemini API spending programmatically to identify cost spikes and optimize model selection. Get started free.

Gemini Model Tiers

Flash Lite

The most affordable tier. Gemini 2.0 Flash Lite starts at just $0.075/1M input tokens, ideal for high-volume tasks like classification, extraction, and real-time applications.

Flash

Fast and cost-effective. Gemini 2.5 Flash and 3 Flash offer strong reasoning at $0.10-0.50/1M input tokens, great for coding, analysis, and multi-step workflows.

Pro

Most capable tier for complex reasoning and research. Gemini 2.5 Pro and 3 Pro use breakpoint pricing starting at $1.25-2/1M input tokens under 200K context.

Frequently Asked Questions

How much does the Gemini API cost?

Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.

What is the cheapest Gemini model?

Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.

Does Gemini have breakpoint pricing?

Yes. Gemini 2.5 Pro, Gemini 3 Pro, and Gemini 3.1 Pro use breakpoint pricing where costs increase when input exceeds 200K tokens.

How does Gemini pricing compare to GPT and Claude?

Gemini Flash models are among the most affordable LLM APIs. Gemini 2.0 Flash at $0.10/1M input tokens undercuts most competitors, while Gemini Pro models are competitively priced with Claude Sonnet and GPT-5.

Does Gemini support multimodal pricing?

Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.

Is the Gemini API free?

Google offers a free tier for Gemini API with rate limits. Paid usage starts at $0.075/1M input tokens for Flash Lite. Check the Google AI Studio dashboard for current free-tier quotas.

How much does Gemini 3 cost?

Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).

Which Gemini model is best for coding?

Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.

Estimate Your Gemini API Costs

Get accurate, real-time cost estimates for any Gemini model with our API. Or try the LLM cost calculator to compare across all providers.

Get Started Free