# token.app — Full AI Pricing Data
# https://token.app/
# Source: OpenRouter Models API + provider pricing pages
# Last updated: Sat, 06 Jun 2026 02:00:33 GMT
# Total models: 344
# Total providers: 57
#
# All prices in USD per 1,000,000 tokens (1M tokens).
# "free" means the model is available at no cost.
# Context window shown in K (thousands) or M (millions) of tokens.
# Data refreshed hourly. Always verify with official provider docs.
#
# For a machine-readable JSON version: https://token.app/api/models
# For the full site: https://token.app/
# For the LLMs index: https://token.app/llms.txt

---

## API Token Pricing — All Models by Provider

### ~anthropic
  - Anthropic Claude Haiku Latest: input $1.00/1M tokens, output $5.00/1M tokens, 200K ctx
  - Anthropic Claude Sonnet Latest: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus Latest: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx

### ~google
  - Google Gemini Flash Latest: input $1.50/1M tokens, output $9.00/1M tokens, 1M ctx
  - Google Gemini Pro Latest: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx

### ~moonshotai
  - MoonshotAI Kimi Latest: input $0.68/1M tokens, output $3.42/1M tokens, 262K ctx

### ~openai
  - OpenAI GPT Latest: input $5.00/1M tokens, output $30.00/1M tokens, 1M ctx
  - OpenAI GPT Mini Latest: input $0.75/1M tokens, output $4.50/1M tokens, 400K ctx

### ai21
  - AI21: Jamba Large 1.7: input $2.00/1M tokens, output $8.00/1M tokens, 256K ctx

### aion-labs
  - AionLabs: Aion-1.0: input $4.00/1M tokens, output $8.00/1M tokens, 131K ctx
  - AionLabs: Aion-1.0-Mini: input $0.70/1M tokens, output $1.40/1M tokens, 131K ctx
  - AionLabs: Aion-2.0: input $0.80/1M tokens, output $1.60/1M tokens, 131K ctx
  - AionLabs: Aion-RP 1.0 (8B): input $0.80/1M tokens, output $1.60/1M tokens, 33K ctx

### allenai
  - AllenAI: Olmo 3 32B Think: input $0.15/1M tokens, output $0.50/1M tokens, 66K ctx

### amazon
  - Amazon: Nova 2 Lite: input $0.30/1M tokens, output $2.50/1M tokens, 1M ctx
  - Amazon: Nova Lite 1.0: input $0.06/1M tokens, output $0.24/1M tokens, 300K ctx
  - Amazon: Nova Micro 1.0: input $0.04/1M tokens, output $0.14/1M tokens, 128K ctx
  - Amazon: Nova Premier 1.0: input $2.50/1M tokens, output $12.50/1M tokens, 1M ctx
  - Amazon: Nova Pro 1.0: input $0.80/1M tokens, output $3.20/1M tokens, 300K ctx

### anthracite-org
  - Magnum v4 72B: input $3.00/1M tokens, output $5.00/1M tokens, 33K ctx

### anthropic
  - Anthropic: Claude 3 Haiku: input $0.25/1M tokens, output $1.25/1M tokens, 200K ctx
  - Anthropic: Claude 3.5 Haiku: input $0.80/1M tokens, output $4.00/1M tokens, 200K ctx
  - Anthropic: Claude Haiku 4.5: input $1.00/1M tokens, output $5.00/1M tokens, 200K ctx
  - Anthropic: Claude Opus 4: input $15.00/1M tokens, output $75.00/1M tokens, 200K ctx
  - Anthropic: Claude Opus 4.1: input $15.00/1M tokens, output $75.00/1M tokens, 200K ctx
  - Anthropic: Claude Opus 4.5: input $5.00/1M tokens, output $25.00/1M tokens, 200K ctx
  - Anthropic: Claude Opus 4.6: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus 4.6 (Fast): input $30.00/1M tokens, output $150.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus 4.7: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus 4.7 (Fast): input $30.00/1M tokens, output $150.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus 4.8: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx
  - Anthropic: Claude Opus 4.8 (Fast): input $10.00/1M tokens, output $50.00/1M tokens, 1M ctx
  - Anthropic: Claude Sonnet 4: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx
  - Anthropic: Claude Sonnet 4.5: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx
  - Anthropic: Claude Sonnet 4.6: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx

### arcee-ai
  - Arcee AI: Coder Large: input $0.50/1M tokens, output $0.80/1M tokens, 33K ctx
  - Arcee AI: Maestro Reasoning: input $0.90/1M tokens, output $3.30/1M tokens, 131K ctx
  - Arcee AI: Spotlight: input $0.18/1M tokens, output $0.18/1M tokens, 131K ctx
  - Arcee AI: Trinity Large Thinking: input $0.22/1M tokens, output $0.85/1M tokens, 262K ctx
  - Arcee AI: Trinity Mini: input $0.04/1M tokens, output $0.15/1M tokens, 131K ctx
  - Arcee AI: Virtuoso Large: input $0.75/1M tokens, output $1.20/1M tokens, 131K ctx

### baidu
  - Baidu: ERNIE 4.5 VL 28B A3B: input $0.14/1M tokens, output $0.56/1M tokens, 131K ctx
  - Baidu: ERNIE 4.5 VL 424B A47B : input $0.42/1M tokens, output $1.25/1M tokens, 131K ctx

### bytedance
  - ByteDance: UI-TARS 7B : input $0.10/1M tokens, output $0.20/1M tokens, 128K ctx

### bytedance-seed
  - ByteDance Seed: Seed 1.6: input $0.25/1M tokens, output $2.00/1M tokens, 262K ctx
  - ByteDance Seed: Seed 1.6 Flash: input $0.07/1M tokens, output $0.30/1M tokens, 262K ctx
  - ByteDance Seed: Seed-2.0-Lite: input $0.25/1M tokens, output $2.00/1M tokens, 262K ctx
  - ByteDance Seed: Seed-2.0-Mini: input $0.10/1M tokens, output $0.40/1M tokens, 262K ctx

### cognitivecomputations
  - Venice: Uncensored (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx

### cohere
  - Cohere: Command A: input $2.50/1M tokens, output $10.00/1M tokens, 256K ctx
  - Cohere: Command R (08-2024): input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx
  - Cohere: Command R+ (08-2024): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - Cohere: Command R7B (12-2024): input $0.04/1M tokens, output $0.15/1M tokens, 128K ctx

### deepcogito
  - Deep Cogito: Cogito v2.1 671B: input $1.25/1M tokens, output $1.25/1M tokens, 128K ctx

### deepseek
  - DeepSeek: DeepSeek V3: input $0.20/1M tokens, output $0.80/1M tokens, 131K ctx
  - DeepSeek: DeepSeek V3 0324: input $0.20/1M tokens, output $0.77/1M tokens, 164K ctx
  - DeepSeek: DeepSeek V3.1: input $0.21/1M tokens, output $0.79/1M tokens, 164K ctx
  - DeepSeek: DeepSeek V3.1 Terminus: input $0.27/1M tokens, output $0.95/1M tokens, 164K ctx
  - DeepSeek: DeepSeek V3.2: input $0.23/1M tokens, output $0.34/1M tokens, 131K ctx
  - DeepSeek: DeepSeek V3.2 Exp: input $0.27/1M tokens, output $0.41/1M tokens, 164K ctx
  - DeepSeek: DeepSeek V4 Flash: input $0.10/1M tokens, output $0.20/1M tokens, 1M ctx
  - DeepSeek: DeepSeek V4 Pro: input $0.43/1M tokens, output $0.87/1M tokens, 1M ctx
  - DeepSeek: R1: input $0.70/1M tokens, output $2.50/1M tokens, 164K ctx
  - DeepSeek: R1 0528: input $0.50/1M tokens, output $2.15/1M tokens, 164K ctx
  - DeepSeek: R1 Distill Llama 70B: input $0.70/1M tokens, output $0.80/1M tokens, 131K ctx
  - DeepSeek: R1 Distill Qwen 32B: input $0.29/1M tokens, output $0.29/1M tokens, 128K ctx

### essentialai
  - EssentialAI: Rnj 1 Instruct: input $0.15/1M tokens, output $0.15/1M tokens, 33K ctx

### google
  - Google: Gemini 2.5 Flash: input $0.30/1M tokens, output $2.50/1M tokens, 1M ctx
  - Google: Gemini 2.5 Flash Lite: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx
  - Google: Gemini 2.5 Flash Lite Preview 09-2025: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx
  - Google: Gemini 2.5 Pro: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx
  - Google: Gemini 2.5 Pro Preview 05-06: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx
  - Google: Gemini 2.5 Pro Preview 06-05: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx
  - Google: Gemini 3 Flash Preview: input $0.50/1M tokens, output $3.00/1M tokens, 1M ctx
  - Google: Gemini 3.1 Flash Lite: input $0.25/1M tokens, output $1.50/1M tokens, 1M ctx
  - Google: Gemini 3.1 Flash Lite Preview: input $0.25/1M tokens, output $1.50/1M tokens, 1M ctx
  - Google: Gemini 3.1 Pro Preview: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx
  - Google: Gemini 3.1 Pro Preview Custom Tools: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx
  - Google: Gemini 3.5 Flash: input $1.50/1M tokens, output $9.00/1M tokens, 1M ctx
  - Google: Gemma 2 27B: input $0.65/1M tokens, output $0.65/1M tokens, 8K ctx
  - Google: Gemma 3 12B: input $0.04/1M tokens, output $0.13/1M tokens, 131K ctx
  - Google: Gemma 3 27B: input $0.08/1M tokens, output $0.16/1M tokens, 131K ctx
  - Google: Gemma 3 4B: input $0.04/1M tokens, output $0.08/1M tokens, 131K ctx
  - Google: Gemma 3n 4B: input $0.06/1M tokens, output $0.12/1M tokens, 33K ctx
  - Google: Gemma 4 26B A4B : input $0.06/1M tokens, output $0.33/1M tokens, 262K ctx
  - Google: Gemma 4 26B A4B  (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx
  - Google: Gemma 4 31B: input $0.12/1M tokens, output $0.36/1M tokens, 262K ctx
  - Google: Gemma 4 31B (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx
  - Google: Lyria 3 Clip Preview: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - Google: Lyria 3 Pro Preview: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - Google: Nano Banana (Gemini 2.5 Flash Image): input $0.30/1M tokens, output $2.50/1M tokens, 33K ctx
  - Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview): input $0.50/1M tokens, output $3.00/1M tokens, 131K ctx
  - Google: Nano Banana Pro (Gemini 3 Pro Image Preview): input $2.00/1M tokens, output $12.00/1M tokens, 66K ctx

### gryphe
  - MythoMax 13B: input $0.06/1M tokens, output $0.06/1M tokens, 4K ctx

### ibm-granite
  - IBM: Granite 4.0 Micro: input $0.02/1M tokens, output $0.11/1M tokens, 131K ctx
  - IBM: Granite 4.1 8B: input $0.05/1M tokens, output $0.10/1M tokens, 131K ctx

### inception
  - Inception: Mercury 2: input $0.25/1M tokens, output $0.75/1M tokens, 128K ctx

### inclusionai
  - inclusionAI: Ling-2.6-1T: input $0.07/1M tokens, output $0.63/1M tokens, 262K ctx
  - inclusionAI: Ling-2.6-flash: input $0.01/1M tokens, output $0.03/1M tokens, 262K ctx
  - inclusionAI: Ring-2.6-1T: input $0.07/1M tokens, output $0.63/1M tokens, 262K ctx

### inflection
  - Inflection: Inflection 3 Pi: input $2.50/1M tokens, output $10.00/1M tokens, 8K ctx
  - Inflection: Inflection 3 Productivity: input $2.50/1M tokens, output $10.00/1M tokens, 8K ctx

### kwaipilot
  - Kwaipilot: KAT-Coder-Pro V2: input $0.30/1M tokens, output $1.20/1M tokens, 256K ctx

### liquid
  - LiquidAI: LFM2-24B-A2B: input $0.03/1M tokens, output $0.12/1M tokens, 128K ctx
  - LiquidAI: LFM2.5-1.2B-Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx
  - LiquidAI: LFM2.5-1.2B-Thinking (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx

### mancer
  - Mancer: Weaver (alpha): input $0.75/1M tokens, output $1.00/1M tokens, 8K ctx

### meta-llama
  - Llama Guard 3 8B: input $0.48/1M tokens, output $0.03/1M tokens, 131K ctx
  - Meta: Llama 3 70B Instruct: input $0.51/1M tokens, output $0.74/1M tokens, 8K ctx
  - Meta: Llama 3 8B Instruct: input $0.04/1M tokens, output $0.04/1M tokens, 8K ctx
  - Meta: Llama 3.1 70B Instruct: input $0.40/1M tokens, output $0.40/1M tokens, 131K ctx
  - Meta: Llama 3.1 8B Instruct: input $0.02/1M tokens, output $0.03/1M tokens, 131K ctx
  - Meta: Llama 3.2 11B Vision Instruct: input $0.24/1M tokens, output $0.24/1M tokens, 131K ctx
  - Meta: Llama 3.2 1B Instruct: input $0.03/1M tokens, output $0.20/1M tokens, 131K ctx
  - Meta: Llama 3.2 3B Instruct: input $0.05/1M tokens, output $0.34/1M tokens, 131K ctx
  - Meta: Llama 3.2 3B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - Meta: Llama 3.3 70B Instruct: input $0.10/1M tokens, output $0.32/1M tokens, 131K ctx
  - Meta: Llama 3.3 70B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - Meta: Llama 4 Maverick: input $0.15/1M tokens, output $0.60/1M tokens, 1M ctx
  - Meta: Llama 4 Scout: input $0.08/1M tokens, output $0.30/1M tokens, 10M ctx
  - Meta: Llama Guard 4 12B: input $0.18/1M tokens, output $0.18/1M tokens, 164K ctx

### microsoft
  - Microsoft: Phi 4: input $0.07/1M tokens, output $0.14/1M tokens, 16K ctx
  - Microsoft: Phi 4 Mini Instruct: input $0.08/1M tokens, output $0.35/1M tokens, 131K ctx
  - WizardLM-2 8x22B: input $0.62/1M tokens, output $0.62/1M tokens, 66K ctx

### minimax
  - MiniMax: MiniMax M1: input $0.40/1M tokens, output $2.20/1M tokens, 1M ctx
  - MiniMax: MiniMax M2: input $0.26/1M tokens, output $1.00/1M tokens, 205K ctx
  - MiniMax: MiniMax M2-her: input $0.30/1M tokens, output $1.20/1M tokens, 66K ctx
  - MiniMax: MiniMax M2.1: input $0.29/1M tokens, output $0.95/1M tokens, 205K ctx
  - MiniMax: MiniMax M2.5: input $0.15/1M tokens, output $1.15/1M tokens, 205K ctx
  - MiniMax: MiniMax M2.7: input $0.28/1M tokens, output $1.20/1M tokens, 205K ctx
  - MiniMax: MiniMax M3: input $0.30/1M tokens, output $1.20/1M tokens, 1M ctx
  - MiniMax: MiniMax-01: input $0.20/1M tokens, output $1.10/1M tokens, 1M ctx

### mistralai
  - Mistral Large: input $2.00/1M tokens, output $6.00/1M tokens, 128K ctx
  - Mistral Large 2407: input $2.00/1M tokens, output $6.00/1M tokens, 131K ctx
  - Mistral: Codestral 2508: input $0.30/1M tokens, output $0.90/1M tokens, 256K ctx
  - Mistral: Devstral 2 2512: input $0.40/1M tokens, output $2.00/1M tokens, 262K ctx
  - Mistral: Ministral 3 14B 2512: input $0.20/1M tokens, output $0.20/1M tokens, 262K ctx
  - Mistral: Ministral 3 3B 2512: input $0.10/1M tokens, output $0.10/1M tokens, 131K ctx
  - Mistral: Ministral 3 8B 2512: input $0.15/1M tokens, output $0.15/1M tokens, 262K ctx
  - Mistral: Mistral Large 3 2512: input $0.50/1M tokens, output $1.50/1M tokens, 262K ctx
  - Mistral: Mistral Medium 3: input $0.40/1M tokens, output $2.00/1M tokens, 131K ctx
  - Mistral: Mistral Medium 3.1: input $0.40/1M tokens, output $2.00/1M tokens, 131K ctx
  - Mistral: Mistral Medium 3.5: input $1.50/1M tokens, output $7.50/1M tokens, 262K ctx
  - Mistral: Mistral Nemo: input $0.02/1M tokens, output $0.03/1M tokens, 131K ctx
  - Mistral: Mistral Small 3: input $0.05/1M tokens, output $0.08/1M tokens, 33K ctx
  - Mistral: Mistral Small 3.1 24B: input $0.35/1M tokens, output $0.56/1M tokens, 128K ctx
  - Mistral: Mistral Small 3.2 24B: input $0.07/1M tokens, output $0.20/1M tokens, 128K ctx
  - Mistral: Mistral Small 4: input $0.15/1M tokens, output $0.60/1M tokens, 262K ctx
  - Mistral: Mixtral 8x22B Instruct: input $2.00/1M tokens, output $6.00/1M tokens, 66K ctx
  - Mistral: Saba: input $0.20/1M tokens, output $0.60/1M tokens, 33K ctx
  - Mistral: Voxtral Small 24B 2507: input $0.10/1M tokens, output $0.30/1M tokens, 32K ctx

### moonshotai
  - MoonshotAI: Kimi K2 0711: input $0.57/1M tokens, output $2.30/1M tokens, 131K ctx
  - MoonshotAI: Kimi K2 0905: input $0.60/1M tokens, output $2.50/1M tokens, 262K ctx
  - MoonshotAI: Kimi K2 Thinking: input $0.60/1M tokens, output $2.50/1M tokens, 262K ctx
  - MoonshotAI: Kimi K2.5: input $0.40/1M tokens, output $1.90/1M tokens, 262K ctx
  - MoonshotAI: Kimi K2.6: input $0.68/1M tokens, output $3.42/1M tokens, 262K ctx
  - MoonshotAI: Kimi K2.6 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx

### morph
  - Morph: Morph V3 Fast: input $0.80/1M tokens, output $1.20/1M tokens, 82K ctx
  - Morph: Morph V3 Large: input $0.90/1M tokens, output $1.90/1M tokens, 262K ctx

### nex-agi
  - Nex AGI: DeepSeek V3.1 Nex N1: input $0.14/1M tokens, output $0.50/1M tokens, 131K ctx

### nousresearch
  - Nous: Hermes 3 405B Instruct: input $1.00/1M tokens, output $1.00/1M tokens, 131K ctx
  - Nous: Hermes 3 405B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - Nous: Hermes 3 70B Instruct: input $0.30/1M tokens, output $0.30/1M tokens, 131K ctx
  - Nous: Hermes 4 405B: input $1.00/1M tokens, output $3.00/1M tokens, 131K ctx
  - Nous: Hermes 4 70B: input $0.13/1M tokens, output $0.40/1M tokens, 131K ctx

### nvidia
  - NVIDIA: Llama 3.3 Nemotron Super 49B V1.5: input $0.10/1M tokens, output $0.40/1M tokens, 131K ctx
  - NVIDIA: Nemotron 3 Nano 30B A3B: input $0.05/1M tokens, output $0.20/1M tokens, 262K ctx
  - NVIDIA: Nemotron 3 Nano 30B A3B (free): input $0.00/1M tokens, output $0.00/1M tokens, 256K ctx
  - NVIDIA: Nemotron 3 Nano Omni (free): input $0.00/1M tokens, output $0.00/1M tokens, 256K ctx
  - NVIDIA: Nemotron 3 Super: input $0.09/1M tokens, output $0.45/1M tokens, 1M ctx
  - NVIDIA: Nemotron 3 Super (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - NVIDIA: Nemotron 3 Ultra: input $0.50/1M tokens, output $2.50/1M tokens, 1M ctx
  - NVIDIA: Nemotron 3 Ultra (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - NVIDIA: Nemotron 3.5 Content Safety (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx
  - NVIDIA: Nemotron Nano 12B 2 VL (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx
  - NVIDIA: Nemotron Nano 9B V2: input $0.04/1M tokens, output $0.16/1M tokens, 131K ctx
  - NVIDIA: Nemotron Nano 9B V2 (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx

### openai
  - OpenAI: GPT Audio: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT Audio Mini: input $0.60/1M tokens, output $2.40/1M tokens, 128K ctx
  - OpenAI: GPT Chat Latest: input $5.00/1M tokens, output $30.00/1M tokens, 400K ctx
  - OpenAI: GPT-3.5 Turbo: input $0.50/1M tokens, output $1.50/1M tokens, 16K ctx
  - OpenAI: GPT-3.5 Turbo (older v0613): input $1.00/1M tokens, output $2.00/1M tokens, 4K ctx
  - OpenAI: GPT-3.5 Turbo 16k: input $3.00/1M tokens, output $4.00/1M tokens, 16K ctx
  - OpenAI: GPT-3.5 Turbo Instruct: input $1.50/1M tokens, output $2.00/1M tokens, 4K ctx
  - OpenAI: GPT-4: input $30.00/1M tokens, output $60.00/1M tokens, 8K ctx
  - OpenAI: GPT-4 Turbo: input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx
  - OpenAI: GPT-4 Turbo (older v1106): input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx
  - OpenAI: GPT-4 Turbo Preview: input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx
  - OpenAI: GPT-4.1: input $2.00/1M tokens, output $8.00/1M tokens, 1M ctx
  - OpenAI: GPT-4.1 Mini: input $0.40/1M tokens, output $1.60/1M tokens, 1M ctx
  - OpenAI: GPT-4.1 Nano: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx
  - OpenAI: GPT-4o: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-4o (2024-05-13): input $5.00/1M tokens, output $15.00/1M tokens, 128K ctx
  - OpenAI: GPT-4o (2024-08-06): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-4o (2024-11-20): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-4o Search Preview: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-4o-mini: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx
  - OpenAI: GPT-4o-mini (2024-07-18): input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx
  - OpenAI: GPT-4o-mini Search Preview: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx
  - OpenAI: GPT-5: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5 Chat: input $1.25/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-5 Codex: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5 Image: input $10.00/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5 Image Mini: input $2.50/1M tokens, output $2.00/1M tokens, 400K ctx
  - OpenAI: GPT-5 Mini: input $0.25/1M tokens, output $2.00/1M tokens, 400K ctx
  - OpenAI: GPT-5 Nano: input $0.05/1M tokens, output $0.40/1M tokens, 400K ctx
  - OpenAI: GPT-5 Pro: input $15.00/1M tokens, output $120.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.1: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.1 Chat: input $1.25/1M tokens, output $10.00/1M tokens, 128K ctx
  - OpenAI: GPT-5.1-Codex: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.1-Codex-Max: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.1-Codex-Mini: input $0.25/1M tokens, output $2.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.2: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.2 Chat: input $1.75/1M tokens, output $14.00/1M tokens, 128K ctx
  - OpenAI: GPT-5.2 Pro: input $21.00/1M tokens, output $168.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.2-Codex: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.3 Chat: input $1.75/1M tokens, output $14.00/1M tokens, 128K ctx
  - OpenAI: GPT-5.3-Codex: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx
  - OpenAI: GPT-5.4: input $2.50/1M tokens, output $15.00/1M tokens, 1M ctx
  - OpenAI: GPT-5.4 Image 2: input $8.00/1M tokens, output $15.00/1M tokens, 272K ctx
  - OpenAI: GPT-5.4 Mini: input $0.75/1M tokens, output $4.50/1M tokens, 400K ctx
  - OpenAI: GPT-5.4 Nano: input $0.20/1M tokens, output $1.25/1M tokens, 400K ctx
  - OpenAI: GPT-5.4 Pro: input $30.00/1M tokens, output $180.00/1M tokens, 1M ctx
  - OpenAI: GPT-5.5: input $5.00/1M tokens, output $30.00/1M tokens, 1M ctx
  - OpenAI: GPT-5.5 Pro: input $30.00/1M tokens, output $180.00/1M tokens, 1M ctx
  - OpenAI: gpt-oss-120b: input $0.04/1M tokens, output $0.18/1M tokens, 131K ctx
  - OpenAI: gpt-oss-120b (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - OpenAI: gpt-oss-20b: input $0.03/1M tokens, output $0.14/1M tokens, 131K ctx
  - OpenAI: gpt-oss-20b (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - OpenAI: gpt-oss-safeguard-20b: input $0.07/1M tokens, output $0.30/1M tokens, 131K ctx
  - OpenAI: o1: input $15.00/1M tokens, output $60.00/1M tokens, 200K ctx
  - OpenAI: o1-pro: input $150.00/1M tokens, output $600.00/1M tokens, 200K ctx
  - OpenAI: o3: input $2.00/1M tokens, output $8.00/1M tokens, 200K ctx
  - OpenAI: o3 Deep Research: input $10.00/1M tokens, output $40.00/1M tokens, 200K ctx
  - OpenAI: o3 Mini: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx
  - OpenAI: o3 Mini High: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx
  - OpenAI: o3 Pro: input $20.00/1M tokens, output $80.00/1M tokens, 200K ctx
  - OpenAI: o4 Mini: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx
  - OpenAI: o4 Mini Deep Research: input $2.00/1M tokens, output $8.00/1M tokens, 200K ctx
  - OpenAI: o4 Mini High: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx

### openrouter
  - Auto Router: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 2M ctx
  - Body Builder (beta): input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 128K ctx
  - Free Models Router: input $0.00/1M tokens, output $0.00/1M tokens, 200K ctx
  - OpenRouter: Fusion: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 128K ctx
  - Owl Alpha: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - Pareto Code Router: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 2M ctx

### perceptron
  - Perceptron: Perceptron Mk1: input $0.15/1M tokens, output $1.50/1M tokens, 33K ctx

### perplexity
  - Perplexity: Sonar: input $1.00/1M tokens, output $1.00/1M tokens, 127K ctx
  - Perplexity: Sonar Deep Research: input $2.00/1M tokens, output $8.00/1M tokens, 128K ctx
  - Perplexity: Sonar Pro: input $3.00/1M tokens, output $15.00/1M tokens, 200K ctx
  - Perplexity: Sonar Pro Search: input $3.00/1M tokens, output $15.00/1M tokens, 200K ctx
  - Perplexity: Sonar Reasoning Pro: input $2.00/1M tokens, output $8.00/1M tokens, 128K ctx

### poolside
  - Poolside: Laguna M.1 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx
  - Poolside: Laguna XS.2 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx

### prime-intellect
  - Prime Intellect: INTELLECT-3: input $0.20/1M tokens, output $1.10/1M tokens, 131K ctx

### qwen
  - Qwen: Qwen Plus 0728: input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx
  - Qwen: Qwen Plus 0728 (thinking): input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx
  - Qwen: Qwen-Plus: input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx
  - Qwen: Qwen2.5 7B Instruct: input $0.04/1M tokens, output $0.10/1M tokens, 131K ctx
  - Qwen: Qwen2.5 VL 72B Instruct: input $0.25/1M tokens, output $0.75/1M tokens, 131K ctx
  - Qwen: Qwen3 14B: input $0.10/1M tokens, output $0.24/1M tokens, 132K ctx
  - Qwen: Qwen3 235B A22B: input $0.46/1M tokens, output $1.82/1M tokens, 131K ctx
  - Qwen: Qwen3 235B A22B Instruct 2507: input $0.07/1M tokens, output $0.10/1M tokens, 262K ctx
  - Qwen: Qwen3 235B A22B Thinking 2507: input $0.10/1M tokens, output $0.10/1M tokens, 262K ctx
  - Qwen: Qwen3 30B A3B: input $0.09/1M tokens, output $0.45/1M tokens, 131K ctx
  - Qwen: Qwen3 30B A3B Instruct 2507: input $0.05/1M tokens, output $0.19/1M tokens, 131K ctx
  - Qwen: Qwen3 30B A3B Thinking 2507: input $0.08/1M tokens, output $0.40/1M tokens, 131K ctx
  - Qwen: Qwen3 32B: input $0.08/1M tokens, output $0.28/1M tokens, 131K ctx
  - Qwen: Qwen3 8B: input $0.05/1M tokens, output $0.40/1M tokens, 131K ctx
  - Qwen: Qwen3 Coder 30B A3B Instruct: input $0.07/1M tokens, output $0.27/1M tokens, 160K ctx
  - Qwen: Qwen3 Coder 480B A35B: input $0.22/1M tokens, output $1.80/1M tokens, 1M ctx
  - Qwen: Qwen3 Coder 480B A35B (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx
  - Qwen: Qwen3 Coder Flash: input $0.20/1M tokens, output $0.97/1M tokens, 1M ctx
  - Qwen: Qwen3 Coder Next: input $0.11/1M tokens, output $0.80/1M tokens, 262K ctx
  - Qwen: Qwen3 Coder Plus: input $0.65/1M tokens, output $3.25/1M tokens, 1M ctx
  - Qwen: Qwen3 Max: input $0.78/1M tokens, output $3.90/1M tokens, 262K ctx
  - Qwen: Qwen3 Max Thinking: input $0.78/1M tokens, output $3.90/1M tokens, 262K ctx
  - Qwen: Qwen3 Next 80B A3B Instruct: input $0.09/1M tokens, output $1.10/1M tokens, 262K ctx
  - Qwen: Qwen3 Next 80B A3B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx
  - Qwen: Qwen3 Next 80B A3B Thinking: input $0.10/1M tokens, output $0.78/1M tokens, 262K ctx
  - Qwen: Qwen3 VL 235B A22B Instruct: input $0.20/1M tokens, output $0.88/1M tokens, 262K ctx
  - Qwen: Qwen3 VL 235B A22B Thinking: input $0.26/1M tokens, output $2.60/1M tokens, 131K ctx
  - Qwen: Qwen3 VL 30B A3B Instruct: input $0.13/1M tokens, output $0.52/1M tokens, 262K ctx
  - Qwen: Qwen3 VL 30B A3B Thinking: input $0.13/1M tokens, output $1.56/1M tokens, 131K ctx
  - Qwen: Qwen3 VL 32B Instruct: input $0.10/1M tokens, output $0.42/1M tokens, 262K ctx
  - Qwen: Qwen3 VL 8B Instruct: input $0.08/1M tokens, output $0.50/1M tokens, 256K ctx
  - Qwen: Qwen3 VL 8B Thinking: input $0.12/1M tokens, output $1.36/1M tokens, 256K ctx
  - Qwen: Qwen3.5 397B A17B: input $0.39/1M tokens, output $2.34/1M tokens, 262K ctx
  - Qwen: Qwen3.5 Plus 2026-02-15: input $0.26/1M tokens, output $1.56/1M tokens, 1M ctx
  - Qwen: Qwen3.5 Plus 2026-04-20: input $0.30/1M tokens, output $1.80/1M tokens, 1M ctx
  - Qwen: Qwen3.5-122B-A10B: input $0.26/1M tokens, output $2.08/1M tokens, 262K ctx
  - Qwen: Qwen3.5-27B: input $0.20/1M tokens, output $1.56/1M tokens, 262K ctx
  - Qwen: Qwen3.5-35B-A3B: input $0.14/1M tokens, output $1.00/1M tokens, 262K ctx
  - Qwen: Qwen3.5-9B: input $0.04/1M tokens, output $0.15/1M tokens, 262K ctx
  - Qwen: Qwen3.5-Flash: input $0.07/1M tokens, output $0.26/1M tokens, 1M ctx
  - Qwen: Qwen3.6 27B: input $0.29/1M tokens, output $3.20/1M tokens, 262K ctx
  - Qwen: Qwen3.6 35B A3B: input $0.14/1M tokens, output $1.00/1M tokens, 262K ctx
  - Qwen: Qwen3.6 Flash: input $0.19/1M tokens, output $1.13/1M tokens, 1M ctx
  - Qwen: Qwen3.6 Max Preview: input $1.04/1M tokens, output $6.24/1M tokens, 262K ctx
  - Qwen: Qwen3.6 Plus: input $0.33/1M tokens, output $1.95/1M tokens, 1M ctx
  - Qwen: Qwen3.7 Max: input $1.25/1M tokens, output $3.75/1M tokens, 1M ctx
  - Qwen: Qwen3.7 Plus: input $0.40/1M tokens, output $1.60/1M tokens, 1M ctx
  - Qwen2.5 72B Instruct: input $0.36/1M tokens, output $0.40/1M tokens, 131K ctx
  - Qwen2.5 Coder 32B Instruct: input $0.66/1M tokens, output $1.00/1M tokens, 128K ctx

### rekaai
  - Reka Edge: input $0.10/1M tokens, output $0.10/1M tokens, 16K ctx
  - Reka Flash 3: input $0.10/1M tokens, output $0.20/1M tokens, 66K ctx

### relace
  - Relace: Relace Apply 3: input $0.85/1M tokens, output $1.25/1M tokens, 256K ctx
  - Relace: Relace Search: input $1.00/1M tokens, output $3.00/1M tokens, 256K ctx

### sao10k
  - Sao10K: Llama 3 8B Lunaris: input $0.04/1M tokens, output $0.05/1M tokens, 8K ctx
  - Sao10K: Llama 3.1 70B Hanami x1: input $3.00/1M tokens, output $3.00/1M tokens, 16K ctx
  - Sao10K: Llama 3.1 Euryale 70B v2.2: input $0.85/1M tokens, output $0.85/1M tokens, 131K ctx
  - Sao10K: Llama 3.3 Euryale 70B: input $0.65/1M tokens, output $0.75/1M tokens, 131K ctx

### stepfun
  - StepFun: Step 3.5 Flash: input $0.09/1M tokens, output $0.30/1M tokens, 262K ctx
  - StepFun: Step 3.7 Flash: input $0.20/1M tokens, output $1.15/1M tokens, 256K ctx

### switchpoint
  - Switchpoint Router: input $0.85/1M tokens, output $3.40/1M tokens, 131K ctx

### tencent
  - Tencent: Hunyuan A13B Instruct: input $0.14/1M tokens, output $0.57/1M tokens, 131K ctx
  - Tencent: Hy3 preview: input $0.06/1M tokens, output $0.21/1M tokens, 262K ctx

### thedrummer
  - TheDrummer: Cydonia 24B V4.1: input $0.30/1M tokens, output $0.50/1M tokens, 131K ctx
  - TheDrummer: Rocinante 12B: input $0.17/1M tokens, output $0.43/1M tokens, 33K ctx
  - TheDrummer: Skyfall 36B V2: input $0.55/1M tokens, output $0.80/1M tokens, 33K ctx
  - TheDrummer: UnslopNemo 12B: input $0.40/1M tokens, output $0.40/1M tokens, 33K ctx

### undi95
  - ReMM SLERP 13B: input $0.45/1M tokens, output $0.65/1M tokens, 6K ctx

### upstage
  - Upstage: Solar Pro 3: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx

### writer
  - Writer: Palmyra X5: input $0.60/1M tokens, output $6.00/1M tokens, 1M ctx

### x-ai
  - xAI: Grok 4.20: input $1.25/1M tokens, output $2.50/1M tokens, 2M ctx
  - xAI: Grok 4.20 Multi-Agent: input $2.00/1M tokens, output $6.00/1M tokens, 2M ctx
  - xAI: Grok 4.3: input $1.25/1M tokens, output $2.50/1M tokens, 1M ctx
  - xAI: Grok Build 0.1: input $1.00/1M tokens, output $2.00/1M tokens, 256K ctx

### xiaomi
  - Xiaomi: MiMo-V2-Flash: input $0.10/1M tokens, output $0.30/1M tokens, 262K ctx
  - Xiaomi: MiMo-V2.5: input $0.14/1M tokens, output $0.28/1M tokens, 1M ctx
  - Xiaomi: MiMo-V2.5-Pro: input $0.43/1M tokens, output $0.87/1M tokens, 1M ctx

### z-ai
  - Z.ai: GLM 4 32B : input $0.10/1M tokens, output $0.10/1M tokens, 128K ctx
  - Z.ai: GLM 4.5: input $0.60/1M tokens, output $2.20/1M tokens, 131K ctx
  - Z.ai: GLM 4.5 Air: input $0.13/1M tokens, output $0.85/1M tokens, 131K ctx
  - Z.ai: GLM 4.5 Air (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx
  - Z.ai: GLM 4.5V: input $0.60/1M tokens, output $1.80/1M tokens, 66K ctx
  - Z.ai: GLM 4.6: input $0.43/1M tokens, output $1.74/1M tokens, 203K ctx
  - Z.ai: GLM 4.6V: input $0.30/1M tokens, output $0.90/1M tokens, 131K ctx
  - Z.ai: GLM 4.7: input $0.40/1M tokens, output $1.75/1M tokens, 203K ctx
  - Z.ai: GLM 4.7 Flash: input $0.06/1M tokens, output $0.40/1M tokens, 203K ctx
  - Z.ai: GLM 5: input $0.60/1M tokens, output $1.92/1M tokens, 203K ctx
  - Z.ai: GLM 5 Turbo: input $1.20/1M tokens, output $4.00/1M tokens, 203K ctx
  - Z.ai: GLM 5.1: input $0.98/1M tokens, output $3.08/1M tokens, 203K ctx
  - Z.ai: GLM 5V Turbo: input $1.20/1M tokens, output $4.00/1M tokens, 203K ctx

---

## AI Subscription Plans

### ChatGPT
Provider: openai
  - Free: $0/mo
  - Go: $8/mo
  - Plus: $20/mo
  - Pro: $200/mo
  - Business: $25/mo (or $20/mo annual)
  - Enterprise: Contact Sales

### Claude.ai
Provider: anthropic
  - Free: $0/mo
  - Pro: $20/mo (or $17/mo annual)
  - Max 5×: $100/mo
  - Max 20×: $200/mo
  - Team: $30/mo (or $25/mo annual)
  - Enterprise: Contact Sales

### Google AI
Provider: google
  - Free: $0/mo
  - AI Plus: $7.99/mo
  - AI Pro: $19.99/mo
  - AI Ultra: $249.99/mo

### Grok
Provider: x-ai
  - Free: $0/mo
  - X Premium: $8/mo
  - X Premium+: $40/mo
  - SuperGrok: $30/mo (or $25/mo annual)
  - SuperGrok Heavy: $300/mo

### Perplexity
Provider: perplexityai
  - Free: $0/mo
  - Pro: $20/mo (or $16.67/mo annual)
  - Max: $200/mo (or $166.67/mo annual)
  - Enterprise Pro: $40/mo
  - Enterprise Max: $325/mo (or $270.83/mo annual)

### Le Chat
Provider: mistralai
  - Free: $0/mo
  - Student: $7.04/mo
  - Pro: $14.99/mo (or $6.99/mo annual)
  - Team: $24.99/mo (or $19.99/mo annual)
  - Enterprise: Contact Sales

### Kimi
Provider: moonshotai
  - Adagio: $0/mo
  - Andante: $19/mo (or $16/mo annual)
  - Moderato: $39/mo (or $33/mo annual)
  - Vivace: $199/mo

### ERNIE Bot
Provider: baidu
  - Free: $0/mo

### Doubao
Provider: bytedance
  - Free: $0/mo
  - Pro: $9.99/mo (or $8.33/mo annual)

### Z.ai (GLM)
Provider: zhipuai
  - Free: $0/mo
  - Consumer: $4.41/mo (or $3.59/mo annual)
  - Coding Plan: $10/mo

### Cursor
Provider: cursor
  - Hobby: $0/mo
  - Pro: $20/mo (or $16/mo annual)
  - Pro+: $60/mo
  - Ultra: $200/mo
  - Teams: $40/mo (or $32/mo annual)
  - Enterprise: Contact Sales

### GitHub Copilot
Provider: microsoft
  - Free: $0/mo
  - Pro: $10/mo (or $8.33/mo annual)
  - Pro+: $39/mo (or $32.5/mo annual)
  - Business: $19/mo
  - Enterprise: $39/mo

### Windsurf
Provider: windsurf
  - Free: $0/mo
  - Pro: $20/mo
  - Max: $200/mo
  - Teams: $40/mo
  - Enterprise: $60/mo

### Claude Code
Provider: anthropic
  - API Usage: Contact Sales

### Hailuo AI
Provider: minimax
  - Free: $0/mo
  - Standard: $9.99/mo
  - Pro: $34.99/mo
  - Ultra: $124.99/mo
  - Max: $199.99/mo

### Microsoft Copilot
Provider: microsoft
  - Free: $0/mo
  - M365 Premium: $19.99/mo
  - M365 Business: $21/mo (or $18/mo annual)

---

## Notes

- Input tokens = text you send to the model
- Output tokens = text the model generates (usually 2–5× more expensive)
- Context window = maximum total tokens (input + output) per request
- Prices may vary by region, tier, or negotiated enterprise agreement
- Free models may have rate limits or restricted access
- Source: https://token.app/ — updated hourly by Measurable AI (https://measurable.ai/)