Gemini API Pricing — All Google Model Costs (2026)
Complete Gemini API cost breakdown for every Google model. From the ultra-efficient Flash Lite at $0.075/1M tokens to the powerful Pro series with breakpoint pricing, here is every Gemini model's cost updated for 2026. Rates sourced from the official Google Gemini pricing page. Compare with Anthropic Claude and OpenAI, or see all models in our LLM pricing comparison.
All Gemini Models
| Model | Input $/1M tokens | Output $/1M tokens | Type | Notes |
|---|---|---|---|---|
| gemini-2.0-flash | $0.100 | $0.400 | Flat | |
| gemini-2.0-flash-lite | $0.075 | $0.300 | Flat | |
| gemini-2.5-computer-use | $1.25 / $2.50 | $10.00 / $15.00 | Breakpoint | Threshold: 200K tokens |
| gemini-2.5-flash | $0.300 | $2.50 | Flat | |
| gemini-2.5-flash-image | $0.300 | $2.50 | Multimodal | text, image |
| gemini-2.5-flash-lite | $0.100 | $0.400 | Flat | |
| gemini-2.5-flash-native-audio | $0.500 | $2.00 | Multimodal | text, audio |
| gemini-2.5-flash-preview-tts | $0.500 | $10.00 | Flat | |
| gemini-2.5-pro | $1.25 / $2.50 | $10.00 / $15.00 | Breakpoint | Threshold: 200K tokens |
| gemini-2.5-pro-preview-tts | $1.00 | $20.00 | Flat | |
| gemini-3-flash | $0.500 | $3.00 | Flat | |
| gemini-3-pro-image-preview | $2.00 | $12.00 | Multimodal | text, image |
| gemini-3-pro-preview | $2.00 / $4.00 | $12.00 / $18.00 | Breakpoint | Threshold: 200K tokens |
| gemini-3.1-pro-preview | $2.00 / $4.00 | $12.00 / $18.00 | Breakpoint | Threshold: 200K tokens |
Gemini API Cost Breakdown
Gemini API costs depend on which model you use and how many tokens you process. The cheapest option is Gemini 2.0 Flash Lite at just $0.075 per million input tokens — one of the lowest-cost LLM APIs on the market. Mid-tier models like Gemini 2.5 Flash offer stronger reasoning at $0.30/1M input, while Gemini 2.5 Pro starts at $1.25/1M input for complex tasks.
For most workloads, output tokens cost 2-8x more than input tokens. Gemini API pricing is competitive with both OpenAI and Anthropic, especially at the budget and mid-range tiers. Use our LLM cost calculator to estimate your specific Gemini API costs.
How Gemini Pricing Works
Gemini models use three pricing structures. Flat pricing charges a fixed rate per million tokens regardless of context length. This applies to Flash and Flash Lite models, making costs simple and predictable.
Breakpoint pricing applies to Pro-tier models like Gemini 2.5 Pro and Gemini 3 Pro. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold, letting you benefit from lower costs for typical workloads.
Multimodal pricing is used by image and audio models. These have separate rates for each modality — for example, Gemini 3 Pro Image charges $2/1M for text output but $120/1M for image output tokens.
Gemini vs Claude vs GPT: Price Comparison
How does Gemini API pricing stack up against Claude and GPT? Here's a side-by-side comparison at each price tier. See our full LLM pricing comparison for all models.
| Tier | Model | Input / 1M | Output / 1M |
|---|---|---|---|
| Budget | Google gemini-2.0-flash-lite | $0.07 | $0.30 |
| Anthropic claude-haiku-3 | $0.25 | $1.25 | |
| OpenAI gpt-4.1-nano | $0.10 | $0.40 | |
| Mid-range | Google gemini-2.5-flash | $0.30 | $2.50 |
| Anthropic claude-haiku-4-5 | $1.00 | $5.00 | |
| OpenAI gpt-5-mini | $0.25 | $2.00 | |
| Flagship | Google gemini-2.5-pro | $1.25 | $10.00 |
| Anthropic claude-sonnet-4-5 | $3.00 | $15.00 | |
| OpenAI gpt-5 | $1.25 | $10.00 |
Gemini API Monthly Cost Estimates
How much will the Gemini API cost you per month? Google's pricing is among the most competitive, especially at the budget tier. Use our LLM cost calculator for a precise estimate.
Light Use
$1-10/mo
- Personal projects
- <1K requests/day
- Flash Lite for most tasks
Medium Use
$10-75/mo
- Small team apps
- 1-5K requests/day
- Mix of Flash and Pro
Heavy Use
$75-400/mo
- Production apps
- 5-20K requests/day
- Pro for quality tasks
Enterprise
$400+/mo
- Large-scale deployments
- 20K+ requests/day
- Pro with multimodal
Which Gemini Model Should You Use?
Google's model lineup covers everything from ultra-cheap inference to multimodal generation. Here's how to pick the right Gemini model for your use case.
| Use Case | Recommended Model | Est. Monthly Cost | Why This Model |
|---|---|---|---|
| High-volume classification | Flash Lite 2.0 | $1-10 | One of the cheapest LLM APIs available |
| General chatbot | Flash 2.5 | $10-50 | Strong reasoning at very low cost |
| Code & complex reasoning | Gemini 2.5 Pro | $30-150 | Best Gemini model for hard tasks |
| Image generation | Gemini 3 Pro Image | $50-300 | Native multimodal output |
| Real-time applications | Flash 2.0 | $3-20 | Fastest inference, low latency |
5 Ways to Reduce Your Gemini API Costs
Use Flash Lite for simple tasks
At $0.075/1M input tokens, Flash Lite 2.0 is 17x cheaper than Gemini 2.5 Pro. Use it for classification, extraction, and any task that doesn't need deep reasoning.
Stay under the 200K breakpoint
Gemini Pro models double their input cost above 200K tokens. Split long documents into chunks or use Flash models for long-context tasks.
Take advantage of the free tier
Google offers a generous free tier for Gemini API through AI Studio. For low-volume projects, you may not need to pay anything.
Be mindful of multimodal costs
Image output tokens on Gemini 3 Pro Image cost significantly more than text. Only use multimodal models when you actually need image or audio output.
Monitor usage with the ModelPricing API
Track your Gemini API spending programmatically to identify cost spikes and optimize model selection. Get started free.
Gemini Model Tiers
Flash Lite
The most affordable tier. Gemini 2.0 Flash Lite starts at just $0.075/1M input tokens, ideal for high-volume tasks like classification, extraction, and real-time applications.
Flash
Fast and cost-effective. Gemini 2.5 Flash and 3 Flash offer strong reasoning at $0.10-0.50/1M input tokens, great for coding, analysis, and multi-step workflows.
Pro
Most capable tier for complex reasoning and research. Gemini 2.5 Pro and 3 Pro use breakpoint pricing starting at $1.25-2/1M input tokens under 200K context.
Frequently Asked Questions
How much does the Gemini API cost?
Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.
What is the cheapest Gemini model?
Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.
Does Gemini have breakpoint pricing?
Yes. Gemini 2.5 Pro, Gemini 3 Pro, and Gemini 3.1 Pro use breakpoint pricing where costs increase when input exceeds 200K tokens.
How does Gemini pricing compare to GPT and Claude?
Gemini Flash models are among the most affordable LLM APIs. Gemini 2.0 Flash at $0.10/1M input tokens undercuts most competitors, while Gemini Pro models are competitively priced with Claude Sonnet and GPT-5.
Does Gemini support multimodal pricing?
Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.
Is the Gemini API free?
Google offers a free tier for Gemini API with rate limits. Paid usage starts at $0.075/1M input tokens for Flash Lite. Check the Google AI Studio dashboard for current free-tier quotas.
How much does Gemini 3 cost?
Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).
Which Gemini model is best for coding?
Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.
Estimate Your Gemini API Costs
Get accurate, real-time cost estimates for any Gemini model with our API. Or try the LLM cost calculator to compare across all providers.
Get Started Free