Anthropic API Pricing
Anthropic's Claude model family spans from the lightweight Haiku to the flagship Opus. Below is the complete pricing breakdown for every Claude model available through the API, updated for 2026. Rates sourced from the official Anthropic pricing page. Compare with OpenAI and Google Gemini pricing, or see all models in our LLM pricing comparison.
All Claude Models
| Model | Input $/1M tokens | Output $/1M tokens | Type | Notes |
|---|---|---|---|---|
| claude-3-7-sonnet | $3.00 | $15.00 | Flat | |
| claude-haiku-3 | $0.250 | $1.25 | Flat | |
| claude-haiku-3-5 | $0.800 | $4.00 | Flat | |
| claude-haiku-4-5 | $1.00 | $5.00 | Flat | |
| claude-opus-3 | $15.00 | $75.00 | Flat | |
| claude-opus-4-0 | $15.00 | $75.00 | Flat | |
| claude-opus-4-1 | $15.00 | $75.00 | Flat | |
| claude-opus-4-5 | $5.00 | $25.00 | Flat | |
| claude-opus-4-6 | $5.00 / $10.00 | $25.00 / $37.50 | Breakpoint | Threshold: 200K tokens |
| claude-sonnet-4-0 | $3.00 / $6.00 | $15.00 / $22.50 | Breakpoint | Threshold: 200K tokens |
| claude-sonnet-4-5 | $3.00 / $6.00 | $15.00 / $22.50 | Breakpoint | Threshold: 200K tokens |
How Claude Pricing Works
Claude models use two pricing structures. Flat pricing charges a fixed rate per million tokens for both input and output, regardless of context length. This is used by Haiku and older Opus models.
Breakpoint pricing applies to newer models like Opus 4.6 and Sonnet 4.0/4.5. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold. This lets you benefit from lower costs for typical workloads while still accessing the model's full context window.
Claude Model Comparison
Haiku
Fastest and most affordable. Ideal for classification, extraction, and high-volume tasks where speed matters more than reasoning depth.
Sonnet
Best balance of intelligence and cost. Great for coding assistance, analysis, and multi-step workflows that need strong reasoning.
Opus
Most capable model for complex reasoning, research, and tasks requiring deep understanding. Higher cost reflects premium capabilities.
Claude API Monthly Cost Estimates
How much will Claude API cost you per month? Here are realistic estimates based on typical usage patterns. Use our LLM cost calculator for a precise estimate based on your specific workload.
Light Use
$5-20/mo
- Personal projects
- <1K requests/day
- Haiku for most tasks
Medium Use
$20-100/mo
- Small team apps
- 1-5K requests/day
- Mix of Haiku and Sonnet
Heavy Use
$100-500/mo
- Production apps
- 5-20K requests/day
- Sonnet for quality tasks
Enterprise
$500+/mo
- Large-scale deployments
- 20K+ requests/day
- Opus for complex tasks
Which Claude Model Should You Use?
Choosing the right Claude model depends on your task complexity and budget. Here's a quick guide based on common use cases.
| Use Case | Recommended Model | Est. Monthly Cost | Why This Model |
|---|---|---|---|
| Customer support chatbot | Haiku 4.5 | $10-50 | Fast responses, affordable for high volume |
| Code generation | Sonnet 4.5 | $30-150 | Best balance of code quality and cost |
| Research & analysis | Opus 4.6 | $100-500 | Deepest reasoning, handles complex tasks |
| Data extraction | Haiku 3 | $5-20 | Cheapest option, sufficient for structured tasks |
| Agentic workflows | Sonnet 4.0 | $50-200 | Strong reasoning at mid-tier pricing |
5 Ways to Reduce Your Claude API Costs
Use model routing
Start with Haiku for triage and only escalate to Sonnet or Opus when the task requires deeper reasoning. This can cut costs 50-80% for mixed workloads.
Stay under the 200K breakpoint
Opus 4.6 and Sonnet 4.0/4.5 charge higher rates above 200K input tokens. Keep prompts concise or split long documents into smaller chunks to stay under the threshold.
Enable prompt caching
Anthropic offers prompt caching that reduces costs for repeated system prompts. If your requests share the same instructions, caching can significantly lower your per-request cost.
Optimize output length
Output tokens cost 5x more than input tokens across all Claude models. Set appropriate max_tokens limits and use structured output formats to keep responses concise.
Monitor usage with the ModelPricing API
Track your token consumption programmatically to identify cost spikes and optimize your model selection over time. Get started free.
How Claude Pricing Compares to GPT and Gemini
Claude sits in the mid-to-premium tier compared to OpenAI and Google Gemini. Haiku is competitive with budget models, Sonnet matches mid-range pricing, and Opus commands a premium for its reasoning capabilities. See our full LLM pricing comparison for all models.
| Tier | Claude | OpenAI | |
|---|---|---|---|
| Budget | Haiku 3 — $0.25/$1.25 | GPT-5 Nano — $0.05/$0.40 | Flash Lite — $0.075/$0.30 |
| Mid-range | Sonnet 4.5 — $3/$15 | GPT-5 — $1.25/$10 | Gemini 2.5 Pro — $1.25/$10 |
| Flagship | Opus 4.6 — $5/$25 | GPT-5.4 — $2.50/$15 | Gemini 3.1 Pro — $2/$12 |
Frequently Asked Questions
How much does Claude API cost?
Claude API pricing varies by model. Claude Haiku 3 starts at $0.25/1M input tokens, while Claude Opus 4.6 costs $5-10/1M input tokens depending on context length. Most teams spend $20-200/month depending on usage volume and model selection.
What is the cheapest Claude model?
Claude Haiku 3 is the most affordable at $0.25/1M input tokens and $1.25/1M output tokens, making it ideal for high-volume, low-complexity tasks like classification and data extraction.
Does Claude have breakpoint pricing?
Yes. Claude Opus 4.6, Sonnet 4.0, and Sonnet 4.5 use breakpoint pricing where costs increase when input exceeds 200K tokens. Below the threshold, you get lower rates — for example, Opus 4.6 charges $5/1M input under 200K but $10/1M above it.
How much does Claude Code cost?
Claude Code uses the Claude API under the hood. Costs depend on which model you configure — Sonnet 4.5 at $3/$15 per million tokens is the default. Heavy Claude Code users typically spend $50-200/month on API usage. Anthropic also offers Max subscription plans as an alternative to per-token billing.
Is the Claude API free?
Anthropic does not offer a permanent free tier for the Claude API, but new accounts receive a small amount of free credits to get started. After that, usage is billed per token. You can estimate your costs with our LLM cost calculator.
Claude API vs subscription — which is cheaper?
For light usage, the Claude Pro subscription ($20/month) is often cheaper than API billing. For heavy or programmatic usage, the API gives you more control. Claude Max plans ($100-200/month) can be significantly cheaper than equivalent API usage for power users.
How does Claude prompt caching affect pricing?
Anthropic offers prompt caching that stores frequently used system prompts. Cached prompt tokens cost significantly less than standard input tokens, making it highly cost-effective for applications that reuse the same instructions across many requests.
Claude Sonnet vs Opus — which should I use?
Sonnet 4.5 ($3/$15 per million tokens) is the best choice for most tasks including coding, analysis, and multi-step workflows. Opus 4.6 ($5/$25) is worth the premium only for the most complex reasoning, research, and tasks requiring deep understanding. Start with Sonnet and upgrade to Opus only when you need it.
Estimate Your Claude API Costs
Get accurate, real-time cost estimates for any Claude model with our API. Or try the LLM cost calculator to compare across all providers.
Get Started Free