Updated March 2026
AI Model Pricing
Comparison 2026
Compare pricing across 56+ models from 20 providers. Sort by cost, context window, or capabilities. Use the cost calculator to estimate your monthly spend.
56
Models
20
Providers
$0.05/M
Cheapest input
2
Free models
Showing 56 of 56 models
Prices in USD per 1M tokens
| Model | Provider | Category | Input / 1M | Output / 1M | Context | Vision | Tools | |
|---|---|---|---|---|---|---|---|---|
| glm-4.7-flash | Z.AI | free | Free | Free | 128K | -- | Use | |
| glm-4.5-flash | Z.AI | free | Free | Free | 128K | -- | Use | |
| gpt-5-nano | OpenAI | budget | $0.05 | $0.40 | 128K | -- | Use | |
| llama-3.1-8b | Groq | budget | $0.05 | $0.08 | 128K | -- | Use | |
| gemini-2.0-flash | budget | $0.10 | $0.40 | 1M | Use | |||
| mistral-small-4 | Mistral | budget | $0.10 | $0.30 | 128K | -- | Use | |
| step-3.5-flash | StepFun | budget | $0.10 | $0.30 | 128K | -- | Use | |
| gpt-4o-mini | OpenAI | budget | $0.15 | $0.60 | 128K | Use | ||
| gemini-2.5-flash | budget | $0.15 | $0.60 | 1M | Use | |||
| command-r | Cohere | budget | $0.15 | $0.60 | 128K | -- | Use | |
| gpt-5.4-nano | OpenAI | budget | $0.20 | $1.25 | 128K | Use | ||
| grok-4.1-fast | xAI | budget | $0.20 | $0.50 | 131K | -- | Use | |
| llama-3.1-8b (Fireworks) | Fireworks | budget | $0.20 | $0.20 | 128K | -- | Use | |
| jamba-1.5-mini | AI21 | budget | $0.20 | $0.40 | 256K | -- | -- | Use |
| mixtral-8x7b | Groq | budget | $0.24 | $0.24 | 33K | -- | Use | |
| gpt-5-mini | OpenAI | budget | $0.25 | $2.00 | 128K | Use | ||
| gemini-3.1-flash-lite | budget | $0.25 | $1.50 | 1M | Use | |||
| MiniMax-M2 | MiniMax | budget | $0.26 | $1.00 | 245K | -- | Use | |
| deepseek-chat | DeepSeek | budget | $0.27 | $1.10 | 128K | -- | Use | |
| codestral | Mistral | code | $0.30 | $0.90 | 256K | -- | -- | Use |
| MiniMax-M2.5 | MiniMax | budget | $0.30 | $1.20 | 1M | Use | ||
| qwen-turbo | Qwen | budget | $0.30 | $0.60 | 128K | -- | Use | |
| llama-3.1-70b (DeepInfra) | DeepInfra | standard | $0.52 | $0.75 | 128K | -- | Use | |
| deepseek-reasoner | DeepSeek | reasoning | $0.55 | $2.19 | 128K | -- | -- | Use |
| llama-3.3-70b | Groq | standard | $0.59 | $0.79 | 128K | -- | Use | |
| kimi-k2.5 | Moonshot | standard | $0.60 | $2.50 | 256K | Use | ||
| gpt-5.4-mini | OpenAI | standard | $0.75 | $4.50 | 256K | Use | ||
| claude-haiku-3.5 | Anthropic | budget | $0.80 | $4.00 | 200K | Use | ||
| llama-3.3-70b (Cerebras) | Cerebras | standard | $0.85 | $1.20 | 128K | -- | Use | |
| llama-3.1-70b (Together) | Together | standard | $0.88 | $0.88 | 128K | -- | Use | |
| sonar | Perplexity | search | $1.00 | $5.00 | 128K | -- | -- | Use |
| sonar-reasoning | Perplexity | reasoning | $1.00 | $5.00 | 128K | -- | -- | Use |
| glm-5 | Z.AI | standard | $1.00 | $3.20 | 128K | Use | ||
| o4-mini | OpenAI | reasoning | $1.10 | $4.40 | 200K | Use | ||
| o3-mini | OpenAI | reasoning | $1.10 | $4.40 | 200K | -- | Use | |
| gpt-5.1 | OpenAI | frontier | $1.25 | $10.00 | 256K | Use | ||
| gemini-2.5-pro | standard | $1.25 | $10.00 | 1M | Use | |||
| qwen-max | Qwen | standard | $1.60 | $6.40 | 128K | Use | ||
| gemini-3.1-pro | frontier | $2.00 | $12.00 | 2M | Use | |||
| mistral-large | Mistral | standard | $2.00 | $6.00 | 128K | -- | Use | |
| sonar-deep-research | Perplexity | search | $2.00 | $8.00 | 128K | -- | -- | Use |
| sonar-reasoning-pro | Perplexity | reasoning | $2.00 | $8.00 | 128K | -- | -- | Use |
| jamba-1.5-large | AI21 | standard | $2.00 | $8.00 | 256K | -- | -- | Use |
| gpt-5.4 | OpenAI | frontier | $2.50 | $15.00 | 256K | Use | ||
| gpt-4o | OpenAI | standard | $2.50 | $10.00 | 128K | Use | ||
| command-r-plus | Cohere | standard | $2.50 | $10.00 | 128K | -- | Use | |
| claude-sonnet-4 | Anthropic | standard | $3.00 | $15.00 | 200K | Use | ||
| claude-sonnet-4.5 | Anthropic | standard | $3.00 | $15.00 | 200K | Use | ||
| grok-4 | xAI | frontier | $3.00 | $15.00 | 256K | Use | ||
| grok-3 | xAI | standard | $3.00 | $15.00 | 131K | Use | ||
| sonar-pro | Perplexity | search | $3.00 | $15.00 | 200K | -- | -- | Use |
| llama-3.1-405b (Fireworks) | Fireworks | frontier | $3.00 | $3.00 | 128K | -- | Use | |
| llama-3.1-405b (Together) | Together | frontier | $3.50 | $3.50 | 128K | -- | Use | |
| llama-3.1-405b (SambaNova) | SambaNova | frontier | $5.00 | $10.00 | 128K | -- | Use | |
| o3 | OpenAI | reasoning | $10.00 | $40.00 | 200K | Use | ||
| claude-opus-4 | Anthropic | frontier | $15.00 | $75.00 | 200K | Use |
llama-3.1-8b (Fireworks)
Fireworks
Input / 1M
$0.20
Output / 1M
$0.20
Context
128K
Features
gemini-3.1-flash-lite
Google
Input / 1M
$0.25
Output / 1M
$1.50
Context
1M
Features
llama-3.1-70b (DeepInfra)
DeepInfra
Input / 1M
$0.52
Output / 1M
$0.75
Context
128K
Features
deepseek-reasoner
DeepSeek
Input / 1M
$0.55
Output / 1M
$2.19
Context
128K
Features
--
claude-haiku-3.5
Anthropic
Input / 1M
$0.80
Output / 1M
$4.00
Context
200K
Features
llama-3.3-70b (Cerebras)
Cerebras
Input / 1M
$0.85
Output / 1M
$1.20
Context
128K
Features
llama-3.1-70b (Together)
Together
Input / 1M
$0.88
Output / 1M
$0.88
Context
128K
Features
sonar-reasoning
Perplexity
Input / 1M
$1.00
Output / 1M
$5.00
Context
128K
Features
--
sonar-deep-research
Perplexity
Input / 1M
$2.00
Output / 1M
$8.00
Context
128K
Features
--
sonar-reasoning-pro
Perplexity
Input / 1M
$2.00
Output / 1M
$8.00
Context
128K
Features
--
jamba-1.5-large
AI21
Input / 1M
$2.00
Output / 1M
$8.00
Context
256K
Features
--
command-r-plus
Cohere
Input / 1M
$2.50
Output / 1M
$10.00
Context
128K
Features
claude-sonnet-4
Anthropic
Input / 1M
$3.00
Output / 1M
$15.00
Context
200K
Features
claude-sonnet-4.5
Anthropic
Input / 1M
$3.00
Output / 1M
$15.00
Context
200K
Features
llama-3.1-405b (Fireworks)
Fireworks
Input / 1M
$3.00
Output / 1M
$3.00
Context
128K
Features
llama-3.1-405b (Together)
Together
Input / 1M
$3.50
Output / 1M
$3.50
Context
128K
Features
llama-3.1-405b (SambaNova)
SambaNova
Input / 1M
$5.00
Output / 1M
$10.00
Context
128K
Features
claude-opus-4
Anthropic
Input / 1M
$15.00
Output / 1M
$75.00
Context
200K
Features
Cost Calculator
Calculate Your Monthly Cost
Select your expected monthly token volume and workload type to see the cheapest models for your use case.
10K500K tokens/mo10M
Top 5 cheapest models for your workload
1
llama-3.1-8b
Groq
$0.03
/month
2
mistral-small-4
Mistral
$0.10
/month
3
step-3.5-flash
StepFun
$0.10
/month
4
llama-3.1-8b (Fireworks)
Fireworks
$0.10
/month
5
gpt-5-nano
OpenAI
$0.11
/month
Start using these models through Curate-Me
Free tier includes 1,000 gateway requests/day. No credit card required.
Access all 56+ models through one gateway
Point your AI SDK at Curate-Me and get cost tracking, personal data scanning, rate limiting, and HITL approvals across every provider. Zero code changes.
# Before (direct to OpenAI):
OPENAI_BASE_URL=https://api.openai.com/v1
# After (through Curate-Me):
OPENAI_BASE_URL=https://api.curate-me.ai/v1/openai
X-CM-API-Key: cm_sk_xxx