# token.app — Full AI Pricing Data # https://token.app/ # Source: OpenRouter Models API + provider pricing pages # Last updated: Sat, 06 Jun 2026 02:00:33 GMT # Total models: 344 # Total providers: 57 # # All prices in USD per 1,000,000 tokens (1M tokens). # "free" means the model is available at no cost. # Context window shown in K (thousands) or M (millions) of tokens. # Data refreshed hourly. Always verify with official provider docs. # # For a machine-readable JSON version: https://token.app/api/models # For the full site: https://token.app/ # For the LLMs index: https://token.app/llms.txt --- ## API Token Pricing — All Models by Provider ### ~anthropic - Anthropic Claude Haiku Latest: input $1.00/1M tokens, output $5.00/1M tokens, 200K ctx - Anthropic Claude Sonnet Latest: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx - Anthropic: Claude Opus Latest: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx ### ~google - Google Gemini Flash Latest: input $1.50/1M tokens, output $9.00/1M tokens, 1M ctx - Google Gemini Pro Latest: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx ### ~moonshotai - MoonshotAI Kimi Latest: input $0.68/1M tokens, output $3.42/1M tokens, 262K ctx ### ~openai - OpenAI GPT Latest: input $5.00/1M tokens, output $30.00/1M tokens, 1M ctx - OpenAI GPT Mini Latest: input $0.75/1M tokens, output $4.50/1M tokens, 400K ctx ### ai21 - AI21: Jamba Large 1.7: input $2.00/1M tokens, output $8.00/1M tokens, 256K ctx ### aion-labs - AionLabs: Aion-1.0: input $4.00/1M tokens, output $8.00/1M tokens, 131K ctx - AionLabs: Aion-1.0-Mini: input $0.70/1M tokens, output $1.40/1M tokens, 131K ctx - AionLabs: Aion-2.0: input $0.80/1M tokens, output $1.60/1M tokens, 131K ctx - AionLabs: Aion-RP 1.0 (8B): input $0.80/1M tokens, output $1.60/1M tokens, 33K ctx ### allenai - AllenAI: Olmo 3 32B Think: input $0.15/1M tokens, output $0.50/1M tokens, 66K ctx ### amazon - Amazon: Nova 2 Lite: input $0.30/1M tokens, output $2.50/1M tokens, 1M ctx - Amazon: Nova Lite 1.0: input $0.06/1M tokens, output $0.24/1M tokens, 300K ctx - Amazon: Nova Micro 1.0: input $0.04/1M tokens, output $0.14/1M tokens, 128K ctx - Amazon: Nova Premier 1.0: input $2.50/1M tokens, output $12.50/1M tokens, 1M ctx - Amazon: Nova Pro 1.0: input $0.80/1M tokens, output $3.20/1M tokens, 300K ctx ### anthracite-org - Magnum v4 72B: input $3.00/1M tokens, output $5.00/1M tokens, 33K ctx ### anthropic - Anthropic: Claude 3 Haiku: input $0.25/1M tokens, output $1.25/1M tokens, 200K ctx - Anthropic: Claude 3.5 Haiku: input $0.80/1M tokens, output $4.00/1M tokens, 200K ctx - Anthropic: Claude Haiku 4.5: input $1.00/1M tokens, output $5.00/1M tokens, 200K ctx - Anthropic: Claude Opus 4: input $15.00/1M tokens, output $75.00/1M tokens, 200K ctx - Anthropic: Claude Opus 4.1: input $15.00/1M tokens, output $75.00/1M tokens, 200K ctx - Anthropic: Claude Opus 4.5: input $5.00/1M tokens, output $25.00/1M tokens, 200K ctx - Anthropic: Claude Opus 4.6: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx - Anthropic: Claude Opus 4.6 (Fast): input $30.00/1M tokens, output $150.00/1M tokens, 1M ctx - Anthropic: Claude Opus 4.7: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx - Anthropic: Claude Opus 4.7 (Fast): input $30.00/1M tokens, output $150.00/1M tokens, 1M ctx - Anthropic: Claude Opus 4.8: input $5.00/1M tokens, output $25.00/1M tokens, 1M ctx - Anthropic: Claude Opus 4.8 (Fast): input $10.00/1M tokens, output $50.00/1M tokens, 1M ctx - Anthropic: Claude Sonnet 4: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx - Anthropic: Claude Sonnet 4.5: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx - Anthropic: Claude Sonnet 4.6: input $3.00/1M tokens, output $15.00/1M tokens, 1M ctx ### arcee-ai - Arcee AI: Coder Large: input $0.50/1M tokens, output $0.80/1M tokens, 33K ctx - Arcee AI: Maestro Reasoning: input $0.90/1M tokens, output $3.30/1M tokens, 131K ctx - Arcee AI: Spotlight: input $0.18/1M tokens, output $0.18/1M tokens, 131K ctx - Arcee AI: Trinity Large Thinking: input $0.22/1M tokens, output $0.85/1M tokens, 262K ctx - Arcee AI: Trinity Mini: input $0.04/1M tokens, output $0.15/1M tokens, 131K ctx - Arcee AI: Virtuoso Large: input $0.75/1M tokens, output $1.20/1M tokens, 131K ctx ### baidu - Baidu: ERNIE 4.5 VL 28B A3B: input $0.14/1M tokens, output $0.56/1M tokens, 131K ctx - Baidu: ERNIE 4.5 VL 424B A47B : input $0.42/1M tokens, output $1.25/1M tokens, 131K ctx ### bytedance - ByteDance: UI-TARS 7B : input $0.10/1M tokens, output $0.20/1M tokens, 128K ctx ### bytedance-seed - ByteDance Seed: Seed 1.6: input $0.25/1M tokens, output $2.00/1M tokens, 262K ctx - ByteDance Seed: Seed 1.6 Flash: input $0.07/1M tokens, output $0.30/1M tokens, 262K ctx - ByteDance Seed: Seed-2.0-Lite: input $0.25/1M tokens, output $2.00/1M tokens, 262K ctx - ByteDance Seed: Seed-2.0-Mini: input $0.10/1M tokens, output $0.40/1M tokens, 262K ctx ### cognitivecomputations - Venice: Uncensored (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx ### cohere - Cohere: Command A: input $2.50/1M tokens, output $10.00/1M tokens, 256K ctx - Cohere: Command R (08-2024): input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx - Cohere: Command R+ (08-2024): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - Cohere: Command R7B (12-2024): input $0.04/1M tokens, output $0.15/1M tokens, 128K ctx ### deepcogito - Deep Cogito: Cogito v2.1 671B: input $1.25/1M tokens, output $1.25/1M tokens, 128K ctx ### deepseek - DeepSeek: DeepSeek V3: input $0.20/1M tokens, output $0.80/1M tokens, 131K ctx - DeepSeek: DeepSeek V3 0324: input $0.20/1M tokens, output $0.77/1M tokens, 164K ctx - DeepSeek: DeepSeek V3.1: input $0.21/1M tokens, output $0.79/1M tokens, 164K ctx - DeepSeek: DeepSeek V3.1 Terminus: input $0.27/1M tokens, output $0.95/1M tokens, 164K ctx - DeepSeek: DeepSeek V3.2: input $0.23/1M tokens, output $0.34/1M tokens, 131K ctx - DeepSeek: DeepSeek V3.2 Exp: input $0.27/1M tokens, output $0.41/1M tokens, 164K ctx - DeepSeek: DeepSeek V4 Flash: input $0.10/1M tokens, output $0.20/1M tokens, 1M ctx - DeepSeek: DeepSeek V4 Pro: input $0.43/1M tokens, output $0.87/1M tokens, 1M ctx - DeepSeek: R1: input $0.70/1M tokens, output $2.50/1M tokens, 164K ctx - DeepSeek: R1 0528: input $0.50/1M tokens, output $2.15/1M tokens, 164K ctx - DeepSeek: R1 Distill Llama 70B: input $0.70/1M tokens, output $0.80/1M tokens, 131K ctx - DeepSeek: R1 Distill Qwen 32B: input $0.29/1M tokens, output $0.29/1M tokens, 128K ctx ### essentialai - EssentialAI: Rnj 1 Instruct: input $0.15/1M tokens, output $0.15/1M tokens, 33K ctx ### google - Google: Gemini 2.5 Flash: input $0.30/1M tokens, output $2.50/1M tokens, 1M ctx - Google: Gemini 2.5 Flash Lite: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx - Google: Gemini 2.5 Flash Lite Preview 09-2025: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx - Google: Gemini 2.5 Pro: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx - Google: Gemini 2.5 Pro Preview 05-06: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx - Google: Gemini 2.5 Pro Preview 06-05: input $1.25/1M tokens, output $10.00/1M tokens, 1M ctx - Google: Gemini 3 Flash Preview: input $0.50/1M tokens, output $3.00/1M tokens, 1M ctx - Google: Gemini 3.1 Flash Lite: input $0.25/1M tokens, output $1.50/1M tokens, 1M ctx - Google: Gemini 3.1 Flash Lite Preview: input $0.25/1M tokens, output $1.50/1M tokens, 1M ctx - Google: Gemini 3.1 Pro Preview: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx - Google: Gemini 3.1 Pro Preview Custom Tools: input $2.00/1M tokens, output $12.00/1M tokens, 1M ctx - Google: Gemini 3.5 Flash: input $1.50/1M tokens, output $9.00/1M tokens, 1M ctx - Google: Gemma 2 27B: input $0.65/1M tokens, output $0.65/1M tokens, 8K ctx - Google: Gemma 3 12B: input $0.04/1M tokens, output $0.13/1M tokens, 131K ctx - Google: Gemma 3 27B: input $0.08/1M tokens, output $0.16/1M tokens, 131K ctx - Google: Gemma 3 4B: input $0.04/1M tokens, output $0.08/1M tokens, 131K ctx - Google: Gemma 3n 4B: input $0.06/1M tokens, output $0.12/1M tokens, 33K ctx - Google: Gemma 4 26B A4B : input $0.06/1M tokens, output $0.33/1M tokens, 262K ctx - Google: Gemma 4 26B A4B (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx - Google: Gemma 4 31B: input $0.12/1M tokens, output $0.36/1M tokens, 262K ctx - Google: Gemma 4 31B (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx - Google: Lyria 3 Clip Preview: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - Google: Lyria 3 Pro Preview: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - Google: Nano Banana (Gemini 2.5 Flash Image): input $0.30/1M tokens, output $2.50/1M tokens, 33K ctx - Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview): input $0.50/1M tokens, output $3.00/1M tokens, 131K ctx - Google: Nano Banana Pro (Gemini 3 Pro Image Preview): input $2.00/1M tokens, output $12.00/1M tokens, 66K ctx ### gryphe - MythoMax 13B: input $0.06/1M tokens, output $0.06/1M tokens, 4K ctx ### ibm-granite - IBM: Granite 4.0 Micro: input $0.02/1M tokens, output $0.11/1M tokens, 131K ctx - IBM: Granite 4.1 8B: input $0.05/1M tokens, output $0.10/1M tokens, 131K ctx ### inception - Inception: Mercury 2: input $0.25/1M tokens, output $0.75/1M tokens, 128K ctx ### inclusionai - inclusionAI: Ling-2.6-1T: input $0.07/1M tokens, output $0.63/1M tokens, 262K ctx - inclusionAI: Ling-2.6-flash: input $0.01/1M tokens, output $0.03/1M tokens, 262K ctx - inclusionAI: Ring-2.6-1T: input $0.07/1M tokens, output $0.63/1M tokens, 262K ctx ### inflection - Inflection: Inflection 3 Pi: input $2.50/1M tokens, output $10.00/1M tokens, 8K ctx - Inflection: Inflection 3 Productivity: input $2.50/1M tokens, output $10.00/1M tokens, 8K ctx ### kwaipilot - Kwaipilot: KAT-Coder-Pro V2: input $0.30/1M tokens, output $1.20/1M tokens, 256K ctx ### liquid - LiquidAI: LFM2-24B-A2B: input $0.03/1M tokens, output $0.12/1M tokens, 128K ctx - LiquidAI: LFM2.5-1.2B-Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx - LiquidAI: LFM2.5-1.2B-Thinking (free): input $0.00/1M tokens, output $0.00/1M tokens, 33K ctx ### mancer - Mancer: Weaver (alpha): input $0.75/1M tokens, output $1.00/1M tokens, 8K ctx ### meta-llama - Llama Guard 3 8B: input $0.48/1M tokens, output $0.03/1M tokens, 131K ctx - Meta: Llama 3 70B Instruct: input $0.51/1M tokens, output $0.74/1M tokens, 8K ctx - Meta: Llama 3 8B Instruct: input $0.04/1M tokens, output $0.04/1M tokens, 8K ctx - Meta: Llama 3.1 70B Instruct: input $0.40/1M tokens, output $0.40/1M tokens, 131K ctx - Meta: Llama 3.1 8B Instruct: input $0.02/1M tokens, output $0.03/1M tokens, 131K ctx - Meta: Llama 3.2 11B Vision Instruct: input $0.24/1M tokens, output $0.24/1M tokens, 131K ctx - Meta: Llama 3.2 1B Instruct: input $0.03/1M tokens, output $0.20/1M tokens, 131K ctx - Meta: Llama 3.2 3B Instruct: input $0.05/1M tokens, output $0.34/1M tokens, 131K ctx - Meta: Llama 3.2 3B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - Meta: Llama 3.3 70B Instruct: input $0.10/1M tokens, output $0.32/1M tokens, 131K ctx - Meta: Llama 3.3 70B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - Meta: Llama 4 Maverick: input $0.15/1M tokens, output $0.60/1M tokens, 1M ctx - Meta: Llama 4 Scout: input $0.08/1M tokens, output $0.30/1M tokens, 10M ctx - Meta: Llama Guard 4 12B: input $0.18/1M tokens, output $0.18/1M tokens, 164K ctx ### microsoft - Microsoft: Phi 4: input $0.07/1M tokens, output $0.14/1M tokens, 16K ctx - Microsoft: Phi 4 Mini Instruct: input $0.08/1M tokens, output $0.35/1M tokens, 131K ctx - WizardLM-2 8x22B: input $0.62/1M tokens, output $0.62/1M tokens, 66K ctx ### minimax - MiniMax: MiniMax M1: input $0.40/1M tokens, output $2.20/1M tokens, 1M ctx - MiniMax: MiniMax M2: input $0.26/1M tokens, output $1.00/1M tokens, 205K ctx - MiniMax: MiniMax M2-her: input $0.30/1M tokens, output $1.20/1M tokens, 66K ctx - MiniMax: MiniMax M2.1: input $0.29/1M tokens, output $0.95/1M tokens, 205K ctx - MiniMax: MiniMax M2.5: input $0.15/1M tokens, output $1.15/1M tokens, 205K ctx - MiniMax: MiniMax M2.7: input $0.28/1M tokens, output $1.20/1M tokens, 205K ctx - MiniMax: MiniMax M3: input $0.30/1M tokens, output $1.20/1M tokens, 1M ctx - MiniMax: MiniMax-01: input $0.20/1M tokens, output $1.10/1M tokens, 1M ctx ### mistralai - Mistral Large: input $2.00/1M tokens, output $6.00/1M tokens, 128K ctx - Mistral Large 2407: input $2.00/1M tokens, output $6.00/1M tokens, 131K ctx - Mistral: Codestral 2508: input $0.30/1M tokens, output $0.90/1M tokens, 256K ctx - Mistral: Devstral 2 2512: input $0.40/1M tokens, output $2.00/1M tokens, 262K ctx - Mistral: Ministral 3 14B 2512: input $0.20/1M tokens, output $0.20/1M tokens, 262K ctx - Mistral: Ministral 3 3B 2512: input $0.10/1M tokens, output $0.10/1M tokens, 131K ctx - Mistral: Ministral 3 8B 2512: input $0.15/1M tokens, output $0.15/1M tokens, 262K ctx - Mistral: Mistral Large 3 2512: input $0.50/1M tokens, output $1.50/1M tokens, 262K ctx - Mistral: Mistral Medium 3: input $0.40/1M tokens, output $2.00/1M tokens, 131K ctx - Mistral: Mistral Medium 3.1: input $0.40/1M tokens, output $2.00/1M tokens, 131K ctx - Mistral: Mistral Medium 3.5: input $1.50/1M tokens, output $7.50/1M tokens, 262K ctx - Mistral: Mistral Nemo: input $0.02/1M tokens, output $0.03/1M tokens, 131K ctx - Mistral: Mistral Small 3: input $0.05/1M tokens, output $0.08/1M tokens, 33K ctx - Mistral: Mistral Small 3.1 24B: input $0.35/1M tokens, output $0.56/1M tokens, 128K ctx - Mistral: Mistral Small 3.2 24B: input $0.07/1M tokens, output $0.20/1M tokens, 128K ctx - Mistral: Mistral Small 4: input $0.15/1M tokens, output $0.60/1M tokens, 262K ctx - Mistral: Mixtral 8x22B Instruct: input $2.00/1M tokens, output $6.00/1M tokens, 66K ctx - Mistral: Saba: input $0.20/1M tokens, output $0.60/1M tokens, 33K ctx - Mistral: Voxtral Small 24B 2507: input $0.10/1M tokens, output $0.30/1M tokens, 32K ctx ### moonshotai - MoonshotAI: Kimi K2 0711: input $0.57/1M tokens, output $2.30/1M tokens, 131K ctx - MoonshotAI: Kimi K2 0905: input $0.60/1M tokens, output $2.50/1M tokens, 262K ctx - MoonshotAI: Kimi K2 Thinking: input $0.60/1M tokens, output $2.50/1M tokens, 262K ctx - MoonshotAI: Kimi K2.5: input $0.40/1M tokens, output $1.90/1M tokens, 262K ctx - MoonshotAI: Kimi K2.6: input $0.68/1M tokens, output $3.42/1M tokens, 262K ctx - MoonshotAI: Kimi K2.6 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx ### morph - Morph: Morph V3 Fast: input $0.80/1M tokens, output $1.20/1M tokens, 82K ctx - Morph: Morph V3 Large: input $0.90/1M tokens, output $1.90/1M tokens, 262K ctx ### nex-agi - Nex AGI: DeepSeek V3.1 Nex N1: input $0.14/1M tokens, output $0.50/1M tokens, 131K ctx ### nousresearch - Nous: Hermes 3 405B Instruct: input $1.00/1M tokens, output $1.00/1M tokens, 131K ctx - Nous: Hermes 3 405B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - Nous: Hermes 3 70B Instruct: input $0.30/1M tokens, output $0.30/1M tokens, 131K ctx - Nous: Hermes 4 405B: input $1.00/1M tokens, output $3.00/1M tokens, 131K ctx - Nous: Hermes 4 70B: input $0.13/1M tokens, output $0.40/1M tokens, 131K ctx ### nvidia - NVIDIA: Llama 3.3 Nemotron Super 49B V1.5: input $0.10/1M tokens, output $0.40/1M tokens, 131K ctx - NVIDIA: Nemotron 3 Nano 30B A3B: input $0.05/1M tokens, output $0.20/1M tokens, 262K ctx - NVIDIA: Nemotron 3 Nano 30B A3B (free): input $0.00/1M tokens, output $0.00/1M tokens, 256K ctx - NVIDIA: Nemotron 3 Nano Omni (free): input $0.00/1M tokens, output $0.00/1M tokens, 256K ctx - NVIDIA: Nemotron 3 Super: input $0.09/1M tokens, output $0.45/1M tokens, 1M ctx - NVIDIA: Nemotron 3 Super (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - NVIDIA: Nemotron 3 Ultra: input $0.50/1M tokens, output $2.50/1M tokens, 1M ctx - NVIDIA: Nemotron 3 Ultra (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - NVIDIA: Nemotron 3.5 Content Safety (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx - NVIDIA: Nemotron Nano 12B 2 VL (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx - NVIDIA: Nemotron Nano 9B V2: input $0.04/1M tokens, output $0.16/1M tokens, 131K ctx - NVIDIA: Nemotron Nano 9B V2 (free): input $0.00/1M tokens, output $0.00/1M tokens, 128K ctx ### openai - OpenAI: GPT Audio: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT Audio Mini: input $0.60/1M tokens, output $2.40/1M tokens, 128K ctx - OpenAI: GPT Chat Latest: input $5.00/1M tokens, output $30.00/1M tokens, 400K ctx - OpenAI: GPT-3.5 Turbo: input $0.50/1M tokens, output $1.50/1M tokens, 16K ctx - OpenAI: GPT-3.5 Turbo (older v0613): input $1.00/1M tokens, output $2.00/1M tokens, 4K ctx - OpenAI: GPT-3.5 Turbo 16k: input $3.00/1M tokens, output $4.00/1M tokens, 16K ctx - OpenAI: GPT-3.5 Turbo Instruct: input $1.50/1M tokens, output $2.00/1M tokens, 4K ctx - OpenAI: GPT-4: input $30.00/1M tokens, output $60.00/1M tokens, 8K ctx - OpenAI: GPT-4 Turbo: input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx - OpenAI: GPT-4 Turbo (older v1106): input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx - OpenAI: GPT-4 Turbo Preview: input $10.00/1M tokens, output $30.00/1M tokens, 128K ctx - OpenAI: GPT-4.1: input $2.00/1M tokens, output $8.00/1M tokens, 1M ctx - OpenAI: GPT-4.1 Mini: input $0.40/1M tokens, output $1.60/1M tokens, 1M ctx - OpenAI: GPT-4.1 Nano: input $0.10/1M tokens, output $0.40/1M tokens, 1M ctx - OpenAI: GPT-4o: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-4o (2024-05-13): input $5.00/1M tokens, output $15.00/1M tokens, 128K ctx - OpenAI: GPT-4o (2024-08-06): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-4o (2024-11-20): input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-4o Search Preview: input $2.50/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-4o-mini: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx - OpenAI: GPT-4o-mini (2024-07-18): input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx - OpenAI: GPT-4o-mini Search Preview: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx - OpenAI: GPT-5: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5 Chat: input $1.25/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-5 Codex: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5 Image: input $10.00/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5 Image Mini: input $2.50/1M tokens, output $2.00/1M tokens, 400K ctx - OpenAI: GPT-5 Mini: input $0.25/1M tokens, output $2.00/1M tokens, 400K ctx - OpenAI: GPT-5 Nano: input $0.05/1M tokens, output $0.40/1M tokens, 400K ctx - OpenAI: GPT-5 Pro: input $15.00/1M tokens, output $120.00/1M tokens, 400K ctx - OpenAI: GPT-5.1: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5.1 Chat: input $1.25/1M tokens, output $10.00/1M tokens, 128K ctx - OpenAI: GPT-5.1-Codex: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5.1-Codex-Max: input $1.25/1M tokens, output $10.00/1M tokens, 400K ctx - OpenAI: GPT-5.1-Codex-Mini: input $0.25/1M tokens, output $2.00/1M tokens, 400K ctx - OpenAI: GPT-5.2: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx - OpenAI: GPT-5.2 Chat: input $1.75/1M tokens, output $14.00/1M tokens, 128K ctx - OpenAI: GPT-5.2 Pro: input $21.00/1M tokens, output $168.00/1M tokens, 400K ctx - OpenAI: GPT-5.2-Codex: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx - OpenAI: GPT-5.3 Chat: input $1.75/1M tokens, output $14.00/1M tokens, 128K ctx - OpenAI: GPT-5.3-Codex: input $1.75/1M tokens, output $14.00/1M tokens, 400K ctx - OpenAI: GPT-5.4: input $2.50/1M tokens, output $15.00/1M tokens, 1M ctx - OpenAI: GPT-5.4 Image 2: input $8.00/1M tokens, output $15.00/1M tokens, 272K ctx - OpenAI: GPT-5.4 Mini: input $0.75/1M tokens, output $4.50/1M tokens, 400K ctx - OpenAI: GPT-5.4 Nano: input $0.20/1M tokens, output $1.25/1M tokens, 400K ctx - OpenAI: GPT-5.4 Pro: input $30.00/1M tokens, output $180.00/1M tokens, 1M ctx - OpenAI: GPT-5.5: input $5.00/1M tokens, output $30.00/1M tokens, 1M ctx - OpenAI: GPT-5.5 Pro: input $30.00/1M tokens, output $180.00/1M tokens, 1M ctx - OpenAI: gpt-oss-120b: input $0.04/1M tokens, output $0.18/1M tokens, 131K ctx - OpenAI: gpt-oss-120b (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - OpenAI: gpt-oss-20b: input $0.03/1M tokens, output $0.14/1M tokens, 131K ctx - OpenAI: gpt-oss-20b (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - OpenAI: gpt-oss-safeguard-20b: input $0.07/1M tokens, output $0.30/1M tokens, 131K ctx - OpenAI: o1: input $15.00/1M tokens, output $60.00/1M tokens, 200K ctx - OpenAI: o1-pro: input $150.00/1M tokens, output $600.00/1M tokens, 200K ctx - OpenAI: o3: input $2.00/1M tokens, output $8.00/1M tokens, 200K ctx - OpenAI: o3 Deep Research: input $10.00/1M tokens, output $40.00/1M tokens, 200K ctx - OpenAI: o3 Mini: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx - OpenAI: o3 Mini High: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx - OpenAI: o3 Pro: input $20.00/1M tokens, output $80.00/1M tokens, 200K ctx - OpenAI: o4 Mini: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx - OpenAI: o4 Mini Deep Research: input $2.00/1M tokens, output $8.00/1M tokens, 200K ctx - OpenAI: o4 Mini High: input $1.10/1M tokens, output $4.40/1M tokens, 200K ctx ### openrouter - Auto Router: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 2M ctx - Body Builder (beta): input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 128K ctx - Free Models Router: input $0.00/1M tokens, output $0.00/1M tokens, 200K ctx - OpenRouter: Fusion: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 128K ctx - Owl Alpha: input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - Pareto Code Router: input $-1000000.00/1M tokens, output $-1000000.00/1M tokens, 2M ctx ### perceptron - Perceptron: Perceptron Mk1: input $0.15/1M tokens, output $1.50/1M tokens, 33K ctx ### perplexity - Perplexity: Sonar: input $1.00/1M tokens, output $1.00/1M tokens, 127K ctx - Perplexity: Sonar Deep Research: input $2.00/1M tokens, output $8.00/1M tokens, 128K ctx - Perplexity: Sonar Pro: input $3.00/1M tokens, output $15.00/1M tokens, 200K ctx - Perplexity: Sonar Pro Search: input $3.00/1M tokens, output $15.00/1M tokens, 200K ctx - Perplexity: Sonar Reasoning Pro: input $2.00/1M tokens, output $8.00/1M tokens, 128K ctx ### poolside - Poolside: Laguna M.1 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx - Poolside: Laguna XS.2 (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx ### prime-intellect - Prime Intellect: INTELLECT-3: input $0.20/1M tokens, output $1.10/1M tokens, 131K ctx ### qwen - Qwen: Qwen Plus 0728: input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx - Qwen: Qwen Plus 0728 (thinking): input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx - Qwen: Qwen-Plus: input $0.26/1M tokens, output $0.78/1M tokens, 1M ctx - Qwen: Qwen2.5 7B Instruct: input $0.04/1M tokens, output $0.10/1M tokens, 131K ctx - Qwen: Qwen2.5 VL 72B Instruct: input $0.25/1M tokens, output $0.75/1M tokens, 131K ctx - Qwen: Qwen3 14B: input $0.10/1M tokens, output $0.24/1M tokens, 132K ctx - Qwen: Qwen3 235B A22B: input $0.46/1M tokens, output $1.82/1M tokens, 131K ctx - Qwen: Qwen3 235B A22B Instruct 2507: input $0.07/1M tokens, output $0.10/1M tokens, 262K ctx - Qwen: Qwen3 235B A22B Thinking 2507: input $0.10/1M tokens, output $0.10/1M tokens, 262K ctx - Qwen: Qwen3 30B A3B: input $0.09/1M tokens, output $0.45/1M tokens, 131K ctx - Qwen: Qwen3 30B A3B Instruct 2507: input $0.05/1M tokens, output $0.19/1M tokens, 131K ctx - Qwen: Qwen3 30B A3B Thinking 2507: input $0.08/1M tokens, output $0.40/1M tokens, 131K ctx - Qwen: Qwen3 32B: input $0.08/1M tokens, output $0.28/1M tokens, 131K ctx - Qwen: Qwen3 8B: input $0.05/1M tokens, output $0.40/1M tokens, 131K ctx - Qwen: Qwen3 Coder 30B A3B Instruct: input $0.07/1M tokens, output $0.27/1M tokens, 160K ctx - Qwen: Qwen3 Coder 480B A35B: input $0.22/1M tokens, output $1.80/1M tokens, 1M ctx - Qwen: Qwen3 Coder 480B A35B (free): input $0.00/1M tokens, output $0.00/1M tokens, 1M ctx - Qwen: Qwen3 Coder Flash: input $0.20/1M tokens, output $0.97/1M tokens, 1M ctx - Qwen: Qwen3 Coder Next: input $0.11/1M tokens, output $0.80/1M tokens, 262K ctx - Qwen: Qwen3 Coder Plus: input $0.65/1M tokens, output $3.25/1M tokens, 1M ctx - Qwen: Qwen3 Max: input $0.78/1M tokens, output $3.90/1M tokens, 262K ctx - Qwen: Qwen3 Max Thinking: input $0.78/1M tokens, output $3.90/1M tokens, 262K ctx - Qwen: Qwen3 Next 80B A3B Instruct: input $0.09/1M tokens, output $1.10/1M tokens, 262K ctx - Qwen: Qwen3 Next 80B A3B Instruct (free): input $0.00/1M tokens, output $0.00/1M tokens, 262K ctx - Qwen: Qwen3 Next 80B A3B Thinking: input $0.10/1M tokens, output $0.78/1M tokens, 262K ctx - Qwen: Qwen3 VL 235B A22B Instruct: input $0.20/1M tokens, output $0.88/1M tokens, 262K ctx - Qwen: Qwen3 VL 235B A22B Thinking: input $0.26/1M tokens, output $2.60/1M tokens, 131K ctx - Qwen: Qwen3 VL 30B A3B Instruct: input $0.13/1M tokens, output $0.52/1M tokens, 262K ctx - Qwen: Qwen3 VL 30B A3B Thinking: input $0.13/1M tokens, output $1.56/1M tokens, 131K ctx - Qwen: Qwen3 VL 32B Instruct: input $0.10/1M tokens, output $0.42/1M tokens, 262K ctx - Qwen: Qwen3 VL 8B Instruct: input $0.08/1M tokens, output $0.50/1M tokens, 256K ctx - Qwen: Qwen3 VL 8B Thinking: input $0.12/1M tokens, output $1.36/1M tokens, 256K ctx - Qwen: Qwen3.5 397B A17B: input $0.39/1M tokens, output $2.34/1M tokens, 262K ctx - Qwen: Qwen3.5 Plus 2026-02-15: input $0.26/1M tokens, output $1.56/1M tokens, 1M ctx - Qwen: Qwen3.5 Plus 2026-04-20: input $0.30/1M tokens, output $1.80/1M tokens, 1M ctx - Qwen: Qwen3.5-122B-A10B: input $0.26/1M tokens, output $2.08/1M tokens, 262K ctx - Qwen: Qwen3.5-27B: input $0.20/1M tokens, output $1.56/1M tokens, 262K ctx - Qwen: Qwen3.5-35B-A3B: input $0.14/1M tokens, output $1.00/1M tokens, 262K ctx - Qwen: Qwen3.5-9B: input $0.04/1M tokens, output $0.15/1M tokens, 262K ctx - Qwen: Qwen3.5-Flash: input $0.07/1M tokens, output $0.26/1M tokens, 1M ctx - Qwen: Qwen3.6 27B: input $0.29/1M tokens, output $3.20/1M tokens, 262K ctx - Qwen: Qwen3.6 35B A3B: input $0.14/1M tokens, output $1.00/1M tokens, 262K ctx - Qwen: Qwen3.6 Flash: input $0.19/1M tokens, output $1.13/1M tokens, 1M ctx - Qwen: Qwen3.6 Max Preview: input $1.04/1M tokens, output $6.24/1M tokens, 262K ctx - Qwen: Qwen3.6 Plus: input $0.33/1M tokens, output $1.95/1M tokens, 1M ctx - Qwen: Qwen3.7 Max: input $1.25/1M tokens, output $3.75/1M tokens, 1M ctx - Qwen: Qwen3.7 Plus: input $0.40/1M tokens, output $1.60/1M tokens, 1M ctx - Qwen2.5 72B Instruct: input $0.36/1M tokens, output $0.40/1M tokens, 131K ctx - Qwen2.5 Coder 32B Instruct: input $0.66/1M tokens, output $1.00/1M tokens, 128K ctx ### rekaai - Reka Edge: input $0.10/1M tokens, output $0.10/1M tokens, 16K ctx - Reka Flash 3: input $0.10/1M tokens, output $0.20/1M tokens, 66K ctx ### relace - Relace: Relace Apply 3: input $0.85/1M tokens, output $1.25/1M tokens, 256K ctx - Relace: Relace Search: input $1.00/1M tokens, output $3.00/1M tokens, 256K ctx ### sao10k - Sao10K: Llama 3 8B Lunaris: input $0.04/1M tokens, output $0.05/1M tokens, 8K ctx - Sao10K: Llama 3.1 70B Hanami x1: input $3.00/1M tokens, output $3.00/1M tokens, 16K ctx - Sao10K: Llama 3.1 Euryale 70B v2.2: input $0.85/1M tokens, output $0.85/1M tokens, 131K ctx - Sao10K: Llama 3.3 Euryale 70B: input $0.65/1M tokens, output $0.75/1M tokens, 131K ctx ### stepfun - StepFun: Step 3.5 Flash: input $0.09/1M tokens, output $0.30/1M tokens, 262K ctx - StepFun: Step 3.7 Flash: input $0.20/1M tokens, output $1.15/1M tokens, 256K ctx ### switchpoint - Switchpoint Router: input $0.85/1M tokens, output $3.40/1M tokens, 131K ctx ### tencent - Tencent: Hunyuan A13B Instruct: input $0.14/1M tokens, output $0.57/1M tokens, 131K ctx - Tencent: Hy3 preview: input $0.06/1M tokens, output $0.21/1M tokens, 262K ctx ### thedrummer - TheDrummer: Cydonia 24B V4.1: input $0.30/1M tokens, output $0.50/1M tokens, 131K ctx - TheDrummer: Rocinante 12B: input $0.17/1M tokens, output $0.43/1M tokens, 33K ctx - TheDrummer: Skyfall 36B V2: input $0.55/1M tokens, output $0.80/1M tokens, 33K ctx - TheDrummer: UnslopNemo 12B: input $0.40/1M tokens, output $0.40/1M tokens, 33K ctx ### undi95 - ReMM SLERP 13B: input $0.45/1M tokens, output $0.65/1M tokens, 6K ctx ### upstage - Upstage: Solar Pro 3: input $0.15/1M tokens, output $0.60/1M tokens, 128K ctx ### writer - Writer: Palmyra X5: input $0.60/1M tokens, output $6.00/1M tokens, 1M ctx ### x-ai - xAI: Grok 4.20: input $1.25/1M tokens, output $2.50/1M tokens, 2M ctx - xAI: Grok 4.20 Multi-Agent: input $2.00/1M tokens, output $6.00/1M tokens, 2M ctx - xAI: Grok 4.3: input $1.25/1M tokens, output $2.50/1M tokens, 1M ctx - xAI: Grok Build 0.1: input $1.00/1M tokens, output $2.00/1M tokens, 256K ctx ### xiaomi - Xiaomi: MiMo-V2-Flash: input $0.10/1M tokens, output $0.30/1M tokens, 262K ctx - Xiaomi: MiMo-V2.5: input $0.14/1M tokens, output $0.28/1M tokens, 1M ctx - Xiaomi: MiMo-V2.5-Pro: input $0.43/1M tokens, output $0.87/1M tokens, 1M ctx ### z-ai - Z.ai: GLM 4 32B : input $0.10/1M tokens, output $0.10/1M tokens, 128K ctx - Z.ai: GLM 4.5: input $0.60/1M tokens, output $2.20/1M tokens, 131K ctx - Z.ai: GLM 4.5 Air: input $0.13/1M tokens, output $0.85/1M tokens, 131K ctx - Z.ai: GLM 4.5 Air (free): input $0.00/1M tokens, output $0.00/1M tokens, 131K ctx - Z.ai: GLM 4.5V: input $0.60/1M tokens, output $1.80/1M tokens, 66K ctx - Z.ai: GLM 4.6: input $0.43/1M tokens, output $1.74/1M tokens, 203K ctx - Z.ai: GLM 4.6V: input $0.30/1M tokens, output $0.90/1M tokens, 131K ctx - Z.ai: GLM 4.7: input $0.40/1M tokens, output $1.75/1M tokens, 203K ctx - Z.ai: GLM 4.7 Flash: input $0.06/1M tokens, output $0.40/1M tokens, 203K ctx - Z.ai: GLM 5: input $0.60/1M tokens, output $1.92/1M tokens, 203K ctx - Z.ai: GLM 5 Turbo: input $1.20/1M tokens, output $4.00/1M tokens, 203K ctx - Z.ai: GLM 5.1: input $0.98/1M tokens, output $3.08/1M tokens, 203K ctx - Z.ai: GLM 5V Turbo: input $1.20/1M tokens, output $4.00/1M tokens, 203K ctx --- ## AI Subscription Plans ### ChatGPT Provider: openai - Free: $0/mo - Go: $8/mo - Plus: $20/mo - Pro: $200/mo - Business: $25/mo (or $20/mo annual) - Enterprise: Contact Sales ### Claude.ai Provider: anthropic - Free: $0/mo - Pro: $20/mo (or $17/mo annual) - Max 5×: $100/mo - Max 20×: $200/mo - Team: $30/mo (or $25/mo annual) - Enterprise: Contact Sales ### Google AI Provider: google - Free: $0/mo - AI Plus: $7.99/mo - AI Pro: $19.99/mo - AI Ultra: $249.99/mo ### Grok Provider: x-ai - Free: $0/mo - X Premium: $8/mo - X Premium+: $40/mo - SuperGrok: $30/mo (or $25/mo annual) - SuperGrok Heavy: $300/mo ### Perplexity Provider: perplexityai - Free: $0/mo - Pro: $20/mo (or $16.67/mo annual) - Max: $200/mo (or $166.67/mo annual) - Enterprise Pro: $40/mo - Enterprise Max: $325/mo (or $270.83/mo annual) ### Le Chat Provider: mistralai - Free: $0/mo - Student: $7.04/mo - Pro: $14.99/mo (or $6.99/mo annual) - Team: $24.99/mo (or $19.99/mo annual) - Enterprise: Contact Sales ### Kimi Provider: moonshotai - Adagio: $0/mo - Andante: $19/mo (or $16/mo annual) - Moderato: $39/mo (or $33/mo annual) - Vivace: $199/mo ### ERNIE Bot Provider: baidu - Free: $0/mo ### Doubao Provider: bytedance - Free: $0/mo - Pro: $9.99/mo (or $8.33/mo annual) ### Z.ai (GLM) Provider: zhipuai - Free: $0/mo - Consumer: $4.41/mo (or $3.59/mo annual) - Coding Plan: $10/mo ### Cursor Provider: cursor - Hobby: $0/mo - Pro: $20/mo (or $16/mo annual) - Pro+: $60/mo - Ultra: $200/mo - Teams: $40/mo (or $32/mo annual) - Enterprise: Contact Sales ### GitHub Copilot Provider: microsoft - Free: $0/mo - Pro: $10/mo (or $8.33/mo annual) - Pro+: $39/mo (or $32.5/mo annual) - Business: $19/mo - Enterprise: $39/mo ### Windsurf Provider: windsurf - Free: $0/mo - Pro: $20/mo - Max: $200/mo - Teams: $40/mo - Enterprise: $60/mo ### Claude Code Provider: anthropic - API Usage: Contact Sales ### Hailuo AI Provider: minimax - Free: $0/mo - Standard: $9.99/mo - Pro: $34.99/mo - Ultra: $124.99/mo - Max: $199.99/mo ### Microsoft Copilot Provider: microsoft - Free: $0/mo - M365 Premium: $19.99/mo - M365 Business: $21/mo (or $18/mo annual) --- ## Notes - Input tokens = text you send to the model - Output tokens = text the model generates (usually 2–5× more expensive) - Context window = maximum total tokens (input + output) per request - Prices may vary by region, tier, or negotiated enterprise agreement - Free models may have rate limits or restricted access - Source: https://token.app/ — updated hourly by Measurable AI (https://measurable.ai/)