Skip to content
Updated March 2026

AI Model Pricing
Comparison 2026

Compare pricing across 56+ models from 20 providers. Sort by cost, context window, or capabilities. Use the cost calculator to estimate your monthly spend.

56
Models
20
Providers
$0.05/M
Cheapest input
2
Free models

Showing 56 of 56 models

Prices in USD per 1M tokens

glm-4.7-flash
Z.AI
free
Input / 1M
Free
Output / 1M
Free
Context
128K
Features
Use through Curate-Me
glm-4.5-flash
Z.AI
free
Input / 1M
Free
Output / 1M
Free
Context
128K
Features
Use through Curate-Me
gpt-5-nano
OpenAI
budget
Input / 1M
$0.05
Output / 1M
$0.40
Context
128K
Features
Use through Curate-Me
llama-3.1-8b
Groq
budget
Input / 1M
$0.05
Output / 1M
$0.08
Context
128K
Features
Use through Curate-Me
gemini-2.0-flash
Google
budget
Input / 1M
$0.10
Output / 1M
$0.40
Context
1M
Features
Use through Curate-Me
mistral-small-4
Mistral
budget
Input / 1M
$0.10
Output / 1M
$0.30
Context
128K
Features
Use through Curate-Me
step-3.5-flash
StepFun
budget
Input / 1M
$0.10
Output / 1M
$0.30
Context
128K
Features
Use through Curate-Me
gpt-4o-mini
OpenAI
budget
Input / 1M
$0.15
Output / 1M
$0.60
Context
128K
Features
Use through Curate-Me
gemini-2.5-flash
Google
budget
Input / 1M
$0.15
Output / 1M
$0.60
Context
1M
Features
Use through Curate-Me
command-r
Cohere
budget
Input / 1M
$0.15
Output / 1M
$0.60
Context
128K
Features
Use through Curate-Me
gpt-5.4-nano
OpenAI
budget
Input / 1M
$0.20
Output / 1M
$1.25
Context
128K
Features
Use through Curate-Me
grok-4.1-fast
xAI
budget
Input / 1M
$0.20
Output / 1M
$0.50
Context
131K
Features
Use through Curate-Me
llama-3.1-8b (Fireworks)
Fireworks
budget
Input / 1M
$0.20
Output / 1M
$0.20
Context
128K
Features
Use through Curate-Me
jamba-1.5-mini
AI21
budget
Input / 1M
$0.20
Output / 1M
$0.40
Context
256K
Features
--
Use through Curate-Me
mixtral-8x7b
Groq
budget
Input / 1M
$0.24
Output / 1M
$0.24
Context
33K
Features
Use through Curate-Me
gpt-5-mini
OpenAI
budget
Input / 1M
$0.25
Output / 1M
$2.00
Context
128K
Features
Use through Curate-Me
gemini-3.1-flash-lite
Google
budget
Input / 1M
$0.25
Output / 1M
$1.50
Context
1M
Features
Use through Curate-Me
MiniMax-M2
MiniMax
budget
Input / 1M
$0.26
Output / 1M
$1.00
Context
245K
Features
Use through Curate-Me
deepseek-chat
DeepSeek
budget
Input / 1M
$0.27
Output / 1M
$1.10
Context
128K
Features
Use through Curate-Me
codestral
Mistral
code
Input / 1M
$0.30
Output / 1M
$0.90
Context
256K
Features
--
Use through Curate-Me
MiniMax-M2.5
MiniMax
budget
Input / 1M
$0.30
Output / 1M
$1.20
Context
1M
Features
Use through Curate-Me
qwen-turbo
Qwen
budget
Input / 1M
$0.30
Output / 1M
$0.60
Context
128K
Features
Use through Curate-Me
llama-3.1-70b (DeepInfra)
DeepInfra
standard
Input / 1M
$0.52
Output / 1M
$0.75
Context
128K
Features
Use through Curate-Me
deepseek-reasoner
DeepSeek
reasoning
Input / 1M
$0.55
Output / 1M
$2.19
Context
128K
Features
--
Use through Curate-Me
llama-3.3-70b
Groq
standard
Input / 1M
$0.59
Output / 1M
$0.79
Context
128K
Features
Use through Curate-Me
kimi-k2.5
Moonshot
standard
Input / 1M
$0.60
Output / 1M
$2.50
Context
256K
Features
Use through Curate-Me
gpt-5.4-mini
OpenAI
standard
Input / 1M
$0.75
Output / 1M
$4.50
Context
256K
Features
Use through Curate-Me
claude-haiku-3.5
Anthropic
budget
Input / 1M
$0.80
Output / 1M
$4.00
Context
200K
Features
Use through Curate-Me
llama-3.3-70b (Cerebras)
Cerebras
standard
Input / 1M
$0.85
Output / 1M
$1.20
Context
128K
Features
Use through Curate-Me
llama-3.1-70b (Together)
Together
standard
Input / 1M
$0.88
Output / 1M
$0.88
Context
128K
Features
Use through Curate-Me
sonar
Perplexity
search
Input / 1M
$1.00
Output / 1M
$5.00
Context
128K
Features
--
Use through Curate-Me
sonar-reasoning
Perplexity
reasoning
Input / 1M
$1.00
Output / 1M
$5.00
Context
128K
Features
--
Use through Curate-Me
glm-5
Z.AI
standard
Input / 1M
$1.00
Output / 1M
$3.20
Context
128K
Features
Use through Curate-Me
o4-mini
OpenAI
reasoning
Input / 1M
$1.10
Output / 1M
$4.40
Context
200K
Features
Use through Curate-Me
o3-mini
OpenAI
reasoning
Input / 1M
$1.10
Output / 1M
$4.40
Context
200K
Features
Use through Curate-Me
gpt-5.1
OpenAI
frontier
Input / 1M
$1.25
Output / 1M
$10.00
Context
256K
Features
Use through Curate-Me
gemini-2.5-pro
Google
standard
Input / 1M
$1.25
Output / 1M
$10.00
Context
1M
Features
Use through Curate-Me
qwen-max
Qwen
standard
Input / 1M
$1.60
Output / 1M
$6.40
Context
128K
Features
Use through Curate-Me
gemini-3.1-pro
Google
frontier
Input / 1M
$2.00
Output / 1M
$12.00
Context
2M
Features
Use through Curate-Me
mistral-large
Mistral
standard
Input / 1M
$2.00
Output / 1M
$6.00
Context
128K
Features
Use through Curate-Me
sonar-deep-research
Perplexity
search
Input / 1M
$2.00
Output / 1M
$8.00
Context
128K
Features
--
Use through Curate-Me
sonar-reasoning-pro
Perplexity
reasoning
Input / 1M
$2.00
Output / 1M
$8.00
Context
128K
Features
--
Use through Curate-Me
jamba-1.5-large
AI21
standard
Input / 1M
$2.00
Output / 1M
$8.00
Context
256K
Features
--
Use through Curate-Me
gpt-5.4
OpenAI
frontier
Input / 1M
$2.50
Output / 1M
$15.00
Context
256K
Features
Use through Curate-Me
gpt-4o
OpenAI
standard
Input / 1M
$2.50
Output / 1M
$10.00
Context
128K
Features
Use through Curate-Me
command-r-plus
Cohere
standard
Input / 1M
$2.50
Output / 1M
$10.00
Context
128K
Features
Use through Curate-Me
claude-sonnet-4
Anthropic
standard
Input / 1M
$3.00
Output / 1M
$15.00
Context
200K
Features
Use through Curate-Me
claude-sonnet-4.5
Anthropic
standard
Input / 1M
$3.00
Output / 1M
$15.00
Context
200K
Features
Use through Curate-Me
grok-4
xAI
frontier
Input / 1M
$3.00
Output / 1M
$15.00
Context
256K
Features
Use through Curate-Me
grok-3
xAI
standard
Input / 1M
$3.00
Output / 1M
$15.00
Context
131K
Features
Use through Curate-Me
sonar-pro
Perplexity
search
Input / 1M
$3.00
Output / 1M
$15.00
Context
200K
Features
--
Use through Curate-Me
llama-3.1-405b (Fireworks)
Fireworks
frontier
Input / 1M
$3.00
Output / 1M
$3.00
Context
128K
Features
Use through Curate-Me
llama-3.1-405b (Together)
Together
frontier
Input / 1M
$3.50
Output / 1M
$3.50
Context
128K
Features
Use through Curate-Me
llama-3.1-405b (SambaNova)
SambaNova
frontier
Input / 1M
$5.00
Output / 1M
$10.00
Context
128K
Features
Use through Curate-Me
o3
OpenAI
reasoning
Input / 1M
$10.00
Output / 1M
$40.00
Context
200K
Features
Use through Curate-Me
claude-opus-4
Anthropic
frontier
Input / 1M
$15.00
Output / 1M
$75.00
Context
200K
Features
Use through Curate-Me
Cost Calculator

Calculate Your Monthly Cost

Select your expected monthly token volume and workload type to see the cheapest models for your use case.

10K500K tokens/mo10M

Top 5 cheapest models for your workload

1
llama-3.1-8b
Groq
$0.03
/month
2
mistral-small-4
Mistral
$0.10
/month
3
step-3.5-flash
StepFun
$0.10
/month
4
llama-3.1-8b (Fireworks)
Fireworks
$0.10
/month
5
gpt-5-nano
OpenAI
$0.11
/month
Start using these models through Curate-Me

Free tier includes 1,000 gateway requests/day. No credit card required.

Access all 56+ models through one gateway

Point your AI SDK at Curate-Me and get cost tracking, personal data scanning, rate limiting, and HITL approvals across every provider. Zero code changes.

# Before (direct to OpenAI):
OPENAI_BASE_URL=https://api.openai.com/v1
# After (through Curate-Me):
OPENAI_BASE_URL=https://api.curate-me.ai/v1/openai
X-CM-API-Key: cm_sk_xxx