Developer model dashboardLiteLLM: liveOpenRouter: liveOfficial catalog: staleBenchmark seed: stale

Compare AI inference models for coding tools, agents, and API usage

Read this as a triage board: start with a ranking, check the data confidence badges, then open a row before choosing a model. Scores are source-backed when available and explicitly marked when stale, estimated, or missing.

Models shown
1675
0 active filters
Public prices
1598/1675
Needed for value ranking
Quality scores
199/1675
Benchmark coverage
Median blend
$1.05
20% input / 80% output
Best first sort
Overall value

Default shortlist when you have price and benchmark coverage.

Watch the badges
1675 incomplete rows

Missing fields stay visible so you can avoid false precision.

Current blend
20% input, 80% output

Move the slider when your workload is prompt-heavy or output-heavy.

Ranking shortlist

Default shortlist when you have price and benchmark coverage.

Primary metric: Value

Price vs quality

Best value usually lives high and left: better public quality with lower blended price.

Click a dot to inspect that model.

120 pointsscaled axes
1Claude Fable 5 (vertex_ai-anthropic_models): Blended $/1M 42, Quality 69.4. Click for details.2GPT 4o Mini (openai): Blended $/1M 0.6, Quality 68.8. Click for details.3Codestral 2501 (mistral): Blended $/1M 0.5, Quality 65.3. Click for details.4Nano Banana 2 (Gemini 3.1 Flash Image Preview) (google): Blended $/1M 2.5, Quality 65.2. Click for details.5Nano Banana Pro (Gemini 3 Pro Image Preview) (google): Blended $/1M 10, Quality 64. Click for details.6Claude Opus 4.8 (vertex_ai-anthropic_models): Blended $/1M 21, Quality 63.7. Click for details.7GPT-5.5 (openai): Blended $/1M 25, Quality 62.1. Click for details.8Claude Opus 4 7 (vertex_ai-anthropic_models): Blended $/1M 21, Quality 61.1. Click for details.Claude Opus 4 6 (vertex_ai-anthropic_models): Blended $/1M 21, Quality 58.4. Click for details.GPT-5.4 (azure_ai): Blended $/1M 12.5, Quality 58.3. Click for details.Gemini 3.5 Flash (vertex_ai-language-models): Blended $/1M 7.5, Quality 57.4. Click for details.Mimo V2.5 Pro (openrouter): Blended $/1M 2.6, Quality 57.2. Click for details.Qwen3.7 Max (qwen): Blended $/1M 3.3, Quality 57.1. Click for details.Gemini 3.1 Pro Preview (vertex_ai-language-models): Blended $/1M 10, Quality 56.7. Click for details.MiniMax M3 (minimax): Blended $/1M 2, Quality 56.5. Click for details.Claude Sonnet 4.6 (vertex_ai-anthropic_models): Blended $/1M 12.6, Quality 56.5. Click for details.Kimi K2.6 (moonshotai): Blended $/1M 3.4, Quality 55.6. Click for details.GLM 5.1 (z-ai): Blended $/1M 2.7, Quality 55.4. Click for details.GPT-5 Image (openai): Blended $/1M 10, Quality 55.2. Click for details.Qwen3.7 Plus (qwen): Blended $/1M 1.1, Quality 55. Click for details.Nano Banana (Gemini 2.5 Flash Image) (google): Blended $/1M 2.1, Quality 54.7. Click for details.Claude Opus 4 5 (vertex_ai-anthropic_models): Blended $/1M 21, Quality 54.2. Click for details.Qwen3.6 Max Preview (qwen): Blended $/1M 5.2, Quality 53.8. Click for details.DeepSeek V4 Pro (deepseek): Blended $/1M 0.8, Quality 53.4. Click for details.GPT-5.4 mini (azure_ai): Blended $/1M 3.8, Quality 53.1. Click for details.Zai.Glm 5 (bedrock_converse): Blended $/1M 2.2, Quality 53. Click for details.GPT 5.3 Codex (github_copilot): Blended $/1M 11.6, Quality 53. Click for details.GPT 5.2 (github_copilot): Blended $/1M 11.6, Quality 52.6. Click for details.Mimo V2.5 (openrouter): Blended $/1M 1.7, Quality 52.5. Click for details.GLM 5 Turbo (z-ai): Blended $/1M 3.4, Quality 52. Click for details.GPT-5 Image Mini (openai): Blended $/1M 2.1, Quality 51.8. Click for details.Qwen3.6 Plus (openrouter): Blended $/1M 1.6, Quality 51.6. Click for details.MiniMax M2.7 (sambanova): Blended $/1M 1, Quality 51.2. Click for details.Grok 4.3 (x-ai): Blended $/1M 2.3, Quality 50.6. Click for details.Moonshotai.Kimi K2.5 (bedrock_converse): Blended $/1M 2.5, Quality 49.8. Click for details.DeepSeek V4 Flash (deepseek): Blended $/1M 0.2, Quality 49.1. Click for details.GPT 5.1 (github_copilot): Blended $/1M 8.3, Quality 48.9. Click for details.Qwen3.5 397b A17b (together_ai): Blended $/1M 3, Quality 48.8. Click for details.Qwen3.6 27B (qwen): Blended $/1M 2.5, Quality 48.4. Click for details.GPT 5.2 Codex (openrouter): Blended $/1M 11.6, Quality 48.2. Click for details.Xai.Grok 4.20 (x-ai): Blended $/1M 12.6, Quality 47.9. Click for details.Deepseek Chat (openrouter): Blended $/1M 0.4, Quality 47.8. Click for details.GPT 5 (github_copilot): Blended $/1M 8.3, Quality 47.7. Click for details.Gemini 3 Flash Preview (vertex_ai-language-models): Blended $/1M 2.5, Quality 47.7. Click for details.Minimax.Minimax M2.5 (bedrock_converse): Blended $/1M 1, Quality 47.5. Click for details.Nemotron 3 Ultra (nvidia): Blended $/1M 2.1, Quality 47.5. Click for details.Nemotron 3 Ultra (free) (nvidia): Blended $/1M 0, Quality 47.5. Click for details.Qwen3.5 Plus 2026-02-15 (openrouter): Blended $/1M 2, Quality 47.3. Click for details.KAT-Coder-Pro V2 (kwaipilot): Blended $/1M 1, Quality 46.7. Click for details.Claude Opus 4 1 (vertex_ai-anthropic_models): Blended $/1M 63, Quality 46.5. Click for details.Zai Glm 4.7 (bedrock_converse): Blended $/1M 2.7, Quality 46.3. Click for details.Claude Sonnet 4 5 (anthropic): Blended $/1M 12.6, Quality 46.2. Click for details.Step 3.7 Flash (stepfun): Blended $/1M 1, Quality 46.2. Click for details.Claude Opus 4 (vertex_ai-anthropic_models): Blended $/1M 63, Quality 45.8. Click for details.Qwen3.6 35B A3B (qwen): Blended $/1M 0.8, Quality 45.7. Click for details.GPT 5.1 Codex (openai): Blended $/1M 8.3, Quality 45.4. Click for details.GPT 5.4 Nano (azure_ai): Blended $/1M 1, Quality 45.2. Click for details.Deepseek V3.2 (bedrock_converse): Blended $/1M 1.5, Quality 44.8. Click for details.GPT 5 Codex (openrouter): Blended $/1M 8.3, Quality 44.8. Click for details.Hy3 preview (tencent): Blended $/1M 0.2, Quality 44.7. Click for details.Minimax.Minimax M2.1 (bedrock_converse): Blended $/1M 1, Quality 43.9. Click for details.Qwen3.5 27b (openrouter): Blended $/1M 2, Quality 43.9. Click for details.Moonshotai.Kimi K2 Thinking (moonshotai): Blended $/1M 2.6, Quality 43.1. Click for details.Qwen3.5 122b A10b (openrouter): Blended $/1M 1.7, Quality 43.1. Click for details.Claude Sonnet 4 (vertex_ai-anthropic_models): Blended $/1M 12.6, Quality 42.7. Click for details.Mistral Medium 3.5 (mistralai): Blended $/1M 6.3, Quality 42.6. Click for details.Mimo V2 Flash (openrouter): Blended $/1M 0.3, Quality 42.4. Click for details.GPT 5 Mini (github_copilot): Blended $/1M 1.7, Quality 41.6. Click for details.O3 (vercel_ai_gateway): Blended $/1M 6.8, Quality 41.4. Click for details.Ring-2.6-1T (inclusionai): Blended $/1M 0.5, Quality 41.1. Click for details.Codestral 2508 (mistralai): Blended $/1M 0.8, Quality 41.1. Click for details.Minimax.Minimax M2 (bedrock_converse): Blended $/1M 1, Quality 40.6. Click for details.Step 3.5 Flash (stepfun): Blended $/1M 0.3, Quality 40.4. Click for details.Mistral Small 3.2 24b Instruct (openrouter): Blended $/1M 0.3, Quality 40.3. Click for details.Qwen3 Max Thinking (qwen): Blended $/1M 3.3, Quality 40.1. Click for details.Gemini 2.5 Pro (google): Blended $/1M 8.3, Quality 39.7. Click for details.Gemma 4 31B (google): Blended $/1M 0.3, Quality 39.6. Click for details.Zai Glm 4.6 (vercel_ai_gateway): Blended $/1M 2.7, Quality 38.9. Click for details.GPT 5.1 Codex Mini (chatgpt): Blended $/1M 1.7, Quality 38.5. Click for details.Claude Haiku 4.5 (vertex_ai-anthropic_models): Blended $/1M 4.2, Quality 38.5. Click for details.Ling-2.6-1T (inclusionai): Blended $/1M 0.5, Quality 38.3. Click for details.Zai.Glm 4.7 Flash (bedrock_converse): Blended $/1M 0.3, Quality 38.2. Click for details.Deepseek V3.1 Terminus (deepseek): Blended $/1M 0.9, Quality 38.1. Click for details.Qwen3.5 35b A3b (openrouter): Blended $/1M 1.7, Quality 37.2. Click for details.GPT 4o (vercel_ai_gateway): Blended $/1M 8.5, Quality 37. Click for details.Deepseek V3.2 Exp (openrouter): Blended $/1M 0.4, Quality 36.7. Click for details.Qwen3 Max (dashscope): Blended $/1M 7.2, Quality 35.9. Click for details.Nemotron 3 Super (nvidia): Blended $/1M 0.4, Quality 35.8. Click for details.Nemotron 3 Super (free) (nvidia): Blended $/1M 0, Quality 35.8. Click for details.Kimi K2 0905 (moonshotai): Blended $/1M 2.1, Quality 35.7. Click for details.O4 Mini (vercel_ai_gateway): Blended $/1M 3.7, Quality 35.1. Click for details.Trinity Large Thinking (arcee-ai): Blended $/1M 0.7, Quality 35. Click for details.Mercury 2 (inception): Blended $/1M 0.7, Quality 34.4. Click for details.GPT Oss 120b (together_ai): Blended $/1M 0.5, Quality 34. Click for details.Deepseek Chat V3.1 (openrouter): Blended $/1M 0.7, Quality 33.7. Click for details.Gemini 3.1 Flash Lite Preview (vertex_ai-language-models): Blended $/1M 1.3, Quality 32. Click for details.Qwen3 Coder 480B A35B (qwen): Blended $/1M 0.8, Quality 31.9. Click for details.Qwen3.5-9B (qwen): Blended $/1M 0.1, Quality 31.7. Click for details.GPT 4.1 (openai): Blended $/1M 6.8, Quality 31.5. Click for details.Deepseek R1 0528 (openrouter): Blended $/1M 0.5, Quality 31.3. Click for details.Qwen3 Coder Next (qwen): Blended $/1M 0.7, Quality 31.1. Click for details.Kimi K2 0711 (vercel_ai_gateway): Blended $/1M 1.9, Quality 31. Click for details.Glm 4.5 (vercel_ai_gateway): Blended $/1M 1.9, Quality 30.9. Click for details.Nemotron 3 Nano 30B A3B (free) (nousresearch): Blended $/1M 0, Quality 30.4. Click for details.Glm 4.5 Air (vercel_ai_gateway): Blended $/1M 0.9, Quality 30.2. Click for details.Qwen3 235B A22B Thinking 2507 (together_ai): Blended $/1M 2.4, Quality 30. Click for details.Ling-2.6-flash (inclusionai): Blended $/1M 0, Quality 29.2. Click for details.GPT 5 Nano (openrouter): Blended $/1M 0.3, Quality 29.2. Click for details.Qwen3 Coder 30b A3b Instruct (novita): Blended $/1M 0.2, Quality 29. Click for details.Nova 2 Lite (amazon): Blended $/1M 2.1, Quality 28.8. Click for details.Gemini 2.5 Flash (vertex_ai-language-models): Blended $/1M 2.1, Quality 28.7. Click for details.Gemma 4 26B A4B (google): Blended $/1M 0.3, Quality 28.6. Click for details.GPT 4.1 Mini (vercel_ai_gateway): Blended $/1M 1.4, Quality 28.3. Click for details.Qwen3 235B A22B Instruct 2507 (openrouter): Blended $/1M 0.1, Quality 27.9. Click for details.Mistral Large 3 2512 (openrouter): Blended $/1M 1.3, Quality 27.8. Click for details.O1 (vercel_ai_gateway): Blended $/1M 51, Quality 27.4. Click for details.Mistral Medium 3.1 (mistralai): Blended $/1M 1.7, Quality 26.8. Click for details.GPT Oss 20b (together_ai): Blended $/1M 0.1, Quality 26.1. Click for details.Mistral Small 4 (mistralai): Blended $/1M 0.5, Quality 26. Click for details.Qwen3 Vl 235b A22b Thinking (dashscope): Blended $/1M 3.3, Quality 25.2. Click for details.Blended $/1M (scaled): 0 to 63Quality: 0 to 69.4

Axes are scaled because a few outliers would otherwise compress the useful cluster. Legend values remain the original public values.

Context window vs price

Look for models far right and low: more context without a large price jump.

Click a dot to inspect that model.

120 pointsscaled axes
1Phi 4 Mini Instruct (microsoft): Context tokens 131.1K, Blended $/1M 29.6K. Click for details.2Jais 30b Chat (azure_ai): Context tokens 8.2K, Blended $/1M 8.4K. Click for details.3Jais 13b Chat (watsonx): Context tokens 8.2K, Blended $/1M 1.7K. Click for details.4Mt0 Xxl 13b (watsonx): Context tokens 8.2K, Blended $/1M 1.7K. Click for details.5O1 Pro (openai): Context tokens 200K, Blended $/1M 510. Click for details.6O1 Pro 2025 03 19 (openai): Context tokens 200K, Blended $/1M 510. Click for details.7GPT 5.4 Pro (azure_ai): Context tokens 1.1M, Blended $/1M 150. Click for details.8GPT 5.4 Pro 2026 03 05 (azure_ai): Context tokens 1.1M, Blended $/1M 150. Click for details.GPT 5.5 Pro (openai): Context tokens 1.1M, Blended $/1M 150. Click for details.GPT 5.5 Pro 2026 04 23 (openai): Context tokens 1.1M, Blended $/1M 150. Click for details.GPT 5.2 Pro 2025 12 11 (openai): Context tokens 272K, Blended $/1M 138.6. Click for details.GPT 5.2 Pro (openrouter): Context tokens 400K, Blended $/1M 138.6. Click for details.GPT 4.5 Preview (azure): Context tokens 128K, Blended $/1M 135. Click for details.Claude Opus 4.7 (Fast) (anthropic): Context tokens 1M, Blended $/1M 126. Click for details.Claude Opus 4.6 (Fast) (github_copilot): Context tokens 1M, Blended $/1M 126. Click for details.GPT 4 32k (azure): Context tokens 32.8K, Blended $/1M 108. Click for details.GPT 4 32k 0613 (azure): Context tokens 32.8K, Blended $/1M 108. Click for details.GPT 5 Pro (openai): Context tokens 400K, Blended $/1M 99. Click for details.GPT 5 Pro 2025 10 06 (openai): Context tokens 128K, Blended $/1M 99. Click for details.Qwen2 Audio 7B Instruct (sambanova): Context tokens 4.1K, Blended $/1M 80.1. Click for details.Claude 4 Opus (vercel_ai_gateway): Context tokens 200K, Blended $/1M 69.3. Click for details.O3 Pro (openai): Context tokens 200K, Blended $/1M 68. Click for details.O3 Pro 2025 06 10 (openai): Context tokens 200K, Blended $/1M 68. Click for details.Databricks Claude Opus 4 (databricks): Context tokens 200K, Blended $/1M 63. Click for details.Databricks Claude Opus 4 1 (databricks): Context tokens 200K, Blended $/1M 63. Click for details.Claude Opus 4 1 (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 63. Click for details.Claude Opus 4 (vertex_ai-anthropic_models): Context tokens 409.6K, Blended $/1M 63. Click for details.Claude 4 Opus 20250514 (anthropic): Context tokens 200K, Blended $/1M 63. Click for details.Anthropic.Claude 3 Opus 20240229 V1:0 (bedrock): Context tokens 200K, Blended $/1M 63. Click for details.Eu.Anthropic.Claude 3 Opus 20240229 V1:0 (bedrock): Context tokens 200K, Blended $/1M 63. Click for details.Us.Anthropic.Claude 3 Opus 20240229 V1:0 (bedrock): Context tokens 200K, Blended $/1M 63. Click for details.Anthropic.Claude Opus 4 1 20250805 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Anthropic.Claude Opus 4 20250514 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Eu.Anthropic.Claude Opus 4 1 20250805 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Eu.Anthropic.Claude Opus 4 20250514 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Us.Anthropic.Claude Opus 4 1 20250805 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Us.Anthropic.Claude Opus 4 20250514 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 63. Click for details.Anthropic Claude 3 Opus (gradient_ai): Context tokens 200K, Blended $/1M 63. Click for details.V0 1.5 Lg (v0): Context tokens 512K, Blended $/1M 63. Click for details.Claude 3 Opus (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 63. Click for details.Claude 3 Opus 20240229 (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 63. Click for details.Claude Opus 4 1 20250805 (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 63. Click for details.Claude Opus 4 20250514 (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 63. Click for details.O1 2024 12 17 (azure): Context tokens 200K, Blended $/1M 56.1. Click for details.O1 2024 12 17 (azure): Context tokens 200K, Blended $/1M 56.1. Click for details.O1 Preview 2024 09 12 (azure): Context tokens 128K, Blended $/1M 56.1. Click for details.O1 Preview 2024 09 12 (azure): Context tokens 128K, Blended $/1M 56.1. Click for details.GPT 4 (github_copilot): Context tokens 32.8K, Blended $/1M 54. Click for details.Ft:GPT 4 0613 (github_copilot): Context tokens 32.8K, Blended $/1M 54. Click for details.GPT 4 0314 (openai): Context tokens 8.2K, Blended $/1M 54. Click for details.O1 (vercel_ai_gateway): Context tokens 200K, Blended $/1M 51. Click for details.O1 Preview (azure): Context tokens 128K, Blended $/1M 51. Click for details.O1 Preview 2024 09 12 (azure): Context tokens 128K, Blended $/1M 51. Click for details.O1 2024 12 17 (openai): Context tokens 200K, Blended $/1M 51. Click for details.Eu.Anthropic.Claude Fable 5 (bedrock_converse): Context tokens 1M, Blended $/1M 46.2. Click for details.Us.Anthropic.Claude Fable 5 (bedrock_converse): Context tokens 1M, Blended $/1M 46.2. Click for details.Claude Fable 5 (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 42. Click for details.Claude Fable Latest (~anthropic): Context tokens 1M, Blended $/1M 42. Click for details.Claude Opus 4.8 (Fast) (anthropic): Context tokens 1M, Blended $/1M 42. Click for details.Anthropic.Claude Fable 5 (bedrock_converse): Context tokens 1M, Blended $/1M 42. Click for details.Global.Anthropic.Claude Fable 5 (bedrock_converse): Context tokens 1M, Blended $/1M 42. Click for details.Claude Fable 5@Default (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 42. Click for details.O3 Deep Research (openai): Context tokens 200K, Blended $/1M 34. Click for details.O3 Deep Research 2025 06 26 (openai): Context tokens 200K, Blended $/1M 34. Click for details.Openai.GPT 5.5 (bedrock_mantle): Context tokens 272K, Blended $/1M 27.5. Click for details.Mistral.Mistral Large 2402 V1:0 (bedrock): Context tokens 32K, Blended $/1M 27. Click for details.GPT 4 Turbo (vercel_ai_gateway): Context tokens 128K, Blended $/1M 26. Click for details.GPT 4 Turbo Vision Preview (azure): Context tokens 128K, Blended $/1M 26. Click for details.GPT 4 0125 Preview (openai): Context tokens 128K, Blended $/1M 26. Click for details.GPT 4 1106 Preview (openai): Context tokens 128K, Blended $/1M 26. Click for details.GPT 4 Turbo 2024 04 09 (openai): Context tokens 128K, Blended $/1M 26. Click for details.GPT 4 Turbo Preview (openai): Context tokens 128K, Blended $/1M 26. Click for details.GPT-5.5 (openai): Context tokens 1.1M, Blended $/1M 25. Click for details.OpenAI GPT Latest (~openai): Context tokens 1.1M, Blended $/1M 25. Click for details.GPT 5.5 2026 04 23 (openai): Context tokens 1.1M, Blended $/1M 25. Click for details.GPT Chat Latest (openai): Context tokens 400K, Blended $/1M 25. Click for details.Text Unicorn (vertex_ai-text-models): Context tokens 8.2K, Blended $/1M 24.4. Click for details.Text Unicorn@001 (vertex_ai-text-models): Context tokens 8.2K, Blended $/1M 24.4. Click for details.Au.Anthropic.Claude Opus 4 6 V1 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Au.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Au.Anthropic.Claude Opus 4 8 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Eu.Anthropic.Claude Opus 4 6 V1 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Eu.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Eu.Anthropic.Claude Opus 4 8 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Jp.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Us.Anthropic.Claude Opus 4 5 20251101 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 23.1. Click for details.Us.Anthropic.Claude Opus 4 6 V1 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Us.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Us.Anthropic.Claude Opus 4 8 (bedrock_converse): Context tokens 1M, Blended $/1M 23.1. Click for details.Databricks Claude Opus 4 5 (databricks): Context tokens 200K, Blended $/1M 21. Click for details.Claude Opus 4.8 (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 7 (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 6 (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 5 (vertex_ai-anthropic_models): Context tokens 409.6K, Blended $/1M 21. Click for details.Claude Opus Latest (~anthropic): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 6 20260205 (anthropic): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 7 20260416 (anthropic): Context tokens 1M, Blended $/1M 21. Click for details.Anthropic.Claude Opus 4 5 20251101 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 21. Click for details.Anthropic.Claude Opus 4 6 V1 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Anthropic.Claude Opus 4 8 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Eu.Anthropic.Claude Opus 4 5 20251101 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 21. Click for details.Global.Anthropic.Claude Opus 4 5 20251101 V1:0 (bedrock_converse): Context tokens 200K, Blended $/1M 21. Click for details.Global.Anthropic.Claude Opus 4 6 V1 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Global.Anthropic.Claude Opus 4 7 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Global.Anthropic.Claude Opus 4 8 (bedrock_converse): Context tokens 1M, Blended $/1M 21. Click for details.Xai.Grok 4 Fast (oci): Context tokens 131.1K, Blended $/1M 21. Click for details.Xai.Grok 4.1 Fast (oci): Context tokens 2M, Blended $/1M 21. Click for details.Xai.Grok Code Fast 1 (oci): Context tokens 256K, Blended $/1M 21. Click for details.Xai.Grok 3 Fast (vercel_ai_gateway): Context tokens 131.1K, Blended $/1M 21. Click for details.Claude Opus 4 5 20251101 (vertex_ai-anthropic_models): Context tokens 200K, Blended $/1M 21. Click for details.Claude Opus 4 6@Default (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 7@Default (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Claude Opus 4 8@Default (vertex_ai-anthropic_models): Context tokens 1M, Blended $/1M 21. Click for details.Grok 3 Fast Beta (xai): Context tokens 131.1K, Blended $/1M 21. Click for details.Grok 3 Fast Latest (xai): Context tokens 131.1K, Blended $/1M 21. Click for details.Anthropic.Claude V1 (bedrock): Context tokens 100K, Blended $/1M 20.8. Click for details.Anthropic.Claude V1 (bedrock): Context tokens 100K, Blended $/1M 20.8. Click for details.Anthropic.Claude V1 (bedrock): Context tokens 100K, Blended $/1M 20.8. Click for details.Anthropic.Claude V1 (bedrock): Context tokens 100K, Blended $/1M 20.8. Click for details.Context tokens (scaled): 0 to 2MBlended $/1M (scaled): 0 to 29.6K

Axes are scaled because a few outliers would otherwise compress the useful cluster. Legend values remain the original public values.

Output price comparison

Selected table rows. Use this before choosing chatty coding or agent models.

Click a bar to open model details.

Value score ranking

Overall value: quality divided by your current blended price.

Click a bar to open model details.

Model table

Sorted by value score descending. Click a row for source values and score math; select up to 8 rows for charts.

1675 rows1598/1675 priced199/1675 scored
PickFeaturesSources
Nemotron 3 Ultra (free)
nvidia/nemotron-3-ultra-550b-a55b:free
3 gapsBenchmark: live
nvidia$0.00$0.00$0.001M65.5K
Reason: YesTools: YesVision: NoCache: Unknown
37.647.7No public latency4,750
OpenRouter
Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free
3 gapsBenchmark: live
nvidia$0.00$0.00$0.001M262.1K
Reason: YesTools: YesVision: NoCache: Unknown
31.236No public latency3,580
OpenRouter
Nemotron 3 Nano 30B A3B (free)
openrouter/openrouter/free
2 gapsBenchmark: live
nousresearch$0.00$0.00$0.001M262.1K
Reason: YesTools: YesVision: YesCache: Unknown
22.431.2No public latency3,040
LiteLLMOpenRouter
Nemotron 3 Nano Omni (free)
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
3 gapsBenchmark: live
nvidia$0.00$0.00$0.00256K65.5K
Reason: YesTools: YesVision: YesCache: Unknown
14.821.4No public latency2,000
OpenRouter
Qwen3 Next 80B A3B Instruct (free)
qwen/qwen3-next-80b-a3b-instruct:free
4 gapsBenchmark: live
qwen$0.00$0.00$0.00262.1KNo public limit
Reason: NoTools: YesVision: NoCache: Unknown
15.320.1No public latency1,650
OpenRouter
Ling-2.6-flash
inclusionai/ling-2.6-flash
2 gapsBenchmark: live
inclusionai$0.01$0.03$0.026262.1K32.8K
Reason: NoTools: YesVision: NoCache: Yes
23.226.2No public latency1,123.1
OpenRouter
Olmo 3 32B Think
publicai/allenai/Olmo-3-32B-Think
3 gapsBenchmark: live
publicai$0.00$0.00$0.0065.5K65.5K
Reason: YesTools: YesVision: NoCache: Unknown
10.512.1No public latency750
LiteLLMOpenRouter
Qwen3 235B A22B Instruct 2507
openrouter/qwen/qwen3-235b-a22b-2507
2 gapsBenchmark: live
openrouter$0.071$0.10$0.0942262.1K262.1K
Reason: NoTools: YesVision: NoCache: Unknown
22.125No public latency296.2
LiteLLMOpenRouter
DeepSeek V4 Flash
deepseek/deepseek-v4-flash
2 gapsBenchmark: live
deepseek$0.098$0.196$0.17641MNo public limit
Reason: YesTools: YesVision: NoCache: Yes
38.746.5No public latency278.3
OpenRouter
Hy3 preview
tencent/hy3-preview
3 gapsBenchmark: live
tencent$0.063$0.21$0.1806262.1KNo public limit
Reason: YesTools: YesVision: NoCache: Yes
36.541.9No public latency247.5
OpenRouter
Qwen3.5-9B
qwen/qwen3.5-9b
3 gapsBenchmark: live
qwen$0.10$0.15$0.14262.1K262.1K
Reason: YesTools: YesVision: YesCache: Unknown
25.332.4No public latency226.4
OpenRouter
GPT Oss 20b
deepinfra/openai/gpt-oss-20b
1 gapsBenchmark: live
together_ai$0.04$0.15$0.128131.1K131.1K
Reason: YesTools: YesVision: YesCache: Yes
18.524.5No public latency203.9
LiteLLMOpenRouter
Llama 3.1 8B Instruct
lambda_ai/llama3.1-8b-instruct
3 gapsBenchmark: live
perplexity$0.025$0.04$0.037131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
4.911.8No public latency200
LiteLLMOpenRouter
Mimo V2 Flash
openrouter/xiaomi/mimo-v2-flash
1 gapsBenchmark: live
openrouter$0.10$0.30$0.26262.1K65.5K
Reason: YesTools: YesVision: NoCache: Yes
33.541.5No public latency163.1
LiteLLMOpenRouter
Ministral 3 3B 2512
openrouter/mistralai/ministral-3b-2512
1 gapsBenchmark: live
openrouter$0.10$0.10$0.10131.1K131.1K
Reason: NoTools: YesVision: YesCache: Yes
4.811.2No public latency159
LiteLLMOpenRouter
Step 3.5 Flash
stepfun/step-3.5-flash
2 gapsBenchmark: live
stepfun$0.09$0.30$0.258262.1K16.4K
Reason: YesTools: YesVision: NoCache: Yes
34.638.5No public latency156.6
OpenRouter
Mistral Small 3.2 24b Instruct
openrouter/mistralai/mistral-small-3.2-24b-instruct
5 gapsBenchmark: live
openrouter$0.10$0.30$0.26128K128K
Reason: NoTools: YesVision: YesCache: Unknown
No scoreNo scoreNo public latency155
LiteLLMOpenRouter
Ministral 3 8B 2512
mistral/ministral-8b-2512
1 gapsBenchmark: live
openrouter$0.15$0.15$0.15262.1K262.1K
Reason: NoTools: YesVision: YesCache: Yes
1014.8No public latency140.7
LiteLLMOpenRouter
Gemma 4 31B
google/gemma-4-31b-it
2 gapsBenchmark: live
google$0.12$0.35$0.304262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
38.739.2No public latency130.3
OpenRouter
Qwen3 Coder 30b A3b Instruct
novita/qwen/qwen3-coder-30b-a3b-instruct
2 gapsBenchmark: live
novita$0.07$0.27$0.23160K32.8K
Reason: NoTools: YesVision: NoCache: Unknown
19.420No public latency126.1
LiteLLMOpenRouter
Codestral 2501
vertex_ai/codestral-2501
5 gapsBenchmark: stale estimate
mistral$0.20$0.60$0.52128K128K
Reason: UnknownTools: YesVision: UnknownCache: Unknown
7061No public latency125.6
LiteLLMBenchmark Seed
GPT 4o Mini
azure/gpt-4o-mini
1 gapsBenchmark: stale estimate
openai$0.165$0.66$0.561131.1K16.4K
Reason: NoTools: YesVision: YesCache: Yes
6863No public latency122.6
LiteLLMOpenRouterBenchmark Seed
Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
3 gapsBenchmark: live
nvidia$0.05$0.20$0.17262.1K228K
Reason: YesTools: YesVision: NoCache: Unknown
1924.3No public latency122.4
OpenRouter
Deepseek Chat
deepseek-chat
4 gapsBenchmark: live
openrouter$0.28$0.42$0.392131.1K16K
Reason: NoTools: YesVision: NoCache: Yes
No scoreNo scoreNo public latency121.9
LiteLLMOpenRouter
Zai.Glm 4.7 Flash
openrouter/z-ai/glm-4.7-flash
1 gapsBenchmark: live
bedrock_converse$0.07$0.40$0.334202.8K128K
Reason: YesTools: YesVision: YesCache: Yes
25.930.1No public latency114.4
LiteLLMOpenRouter
Granite 4.1 8B
ibm-granite/granite-4.1-8b
2 gapsBenchmark: live
ibm-granite$0.05$0.10$0.09131.1K131.1K
Reason: NoTools: YesVision: NoCache: Yes
7.312.4No public latency112.2
OpenRouter
Ministral 3 14B 2512
openrouter/mistralai/ministral-14b-2512
1 gapsBenchmark: live
openrouter$0.20$0.20$0.20262.1K262.1K
Reason: NoTools: YesVision: YesCache: Yes
10.916No public latency109
LiteLLMOpenRouter
Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
4 gapsBenchmark: live
google$0.06$0.33$0.276262.1KNo public limit
Reason: YesTools: YesVision: YesCache: Unknown
22.431.2No public latency103.6
OpenRouter
Deepseek V3.2 Exp
openrouter/deepseek/deepseek-v3.2-exp
1 gapsBenchmark: live
openrouter$0.20$0.40$0.36163.8K163.8K
Reason: YesTools: YesVision: NoCache: Yes
33.332.9No public latency101.9
LiteLLMOpenRouter
Qwen3 8b
llamagate/qwen3-8b
2 gapsBenchmark: live
llamagate$0.04$0.14$0.12131.1K8.2K
Reason: YesTools: YesVision: NoCache: Yes
913.2No public latency96.7
LiteLLMOpenRouter
Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
4 gapsBenchmark: live
nvidia$0.09$0.45$0.3781MNo public limit
Reason: YesTools: YesVision: NoCache: Unknown
31.236No public latency94.7
OpenRouter
GPT 5 Nano
azure/gpt-5-nano
1 gapsBenchmark: live
openrouter$0.05$0.40$0.33400K128K
Reason: YesTools: YesVision: YesCache: Yes
20.326.8No public latency88.5
LiteLLMOpenRouter
Deepseek Chat V3 0324
openrouter/deepseek/deepseek-chat-v3-0324
2 gapsBenchmark: live
openrouter$0.14$0.28$0.252131.1K16.4K
Reason: NoTools: YesVision: NoCache: Yes
2222.3No public latency80.2
LiteLLMOpenRouter
Ring-2.6-1T
inclusionai/ring-2.6-1t
2 gapsBenchmark: live
inclusionai$0.075$0.625$0.515262.1K65.5K
Reason: YesTools: YesVision: NoCache: Yes
33.338.5No public latency79.8
OpenRouter
Qwen3 30b A3b
dashscope/qwen3-30b-a3b
2 gapsBenchmark: live
dashscope$0.08$0.29$0.248131.1K41K
Reason: YesTools: YesVision: NoCache: Unknown
1115.3No public latency77
LiteLLMOpenRouter
Ling-2.6-1T
inclusionai/ling-2.6-1t
2 gapsBenchmark: live
inclusionai$0.075$0.625$0.515262.1K32.8K
Reason: NoTools: YesVision: NoCache: Yes
33.133.6No public latency74.4
OpenRouter
Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
3 gapsBenchmark: live
qwen$0.0482$0.1931$0.1641131.1K32K
Reason: NoTools: YesVision: NoCache: Unknown
14.215No public latency73.8
OpenRouter
Qwen3 14B
deepinfra/Qwen/Qwen3-14B
3 gapsBenchmark: live
deepinfra$0.06$0.24$0.204131.7K41K
Reason: YesTools: YesVision: NoCache: Unknown
13.116.2No public latency71.6
LiteLLMOpenRouter
Gemini 2.5 Flash Lite Preview 09 2025
gemini-2.5-flash-lite-preview-09-2025
1 gapsBenchmark: live
vertex_ai-language-models$0.10$0.40$0.341M65.5K
Reason: YesTools: YesVision: YesCache: Yes
18.221.6No public latency70.3
LiteLLMOpenRouter
Gemma 3 12b It
deepinfra/google/gemma-3-12b-it
3 gapsBenchmark: live
deepinfra$0.05$0.10$0.09131.1K131.1K
Reason: NoTools: YesVision: YesCache: Unknown
6.38.8No public latency68.9
LiteLLMOpenRouter
DeepSeek V4 Pro
deepseek/deepseek-v4-pro
1 gapsBenchmark: live
deepseek$0.435$0.87$0.7831M384K
Reason: YesTools: YesVision: NoCache: Yes
47.551.5No public latency68.2
OpenRouter
GPT Oss 120b
azure_ai/gpt-oss-120b
1 gapsBenchmark: live
together_ai$0.15$0.60$0.51131.1K131.1K
Reason: YesTools: YesVision: YesCache: Yes
28.633.3No public latency66.7
LiteLLMOpenRouter
Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507
1 gapsBenchmark: live
qwen$0.08$0.40$0.336131.1K131.1K
Reason: YesTools: YesVision: NoCache: Yes
14.622.4No public latency66.4
OpenRouter
Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro
3 gapsBenchmark: live
ibm-granite$0.017$0.112$0.093131K131K
Reason: NoTools: NoVision: NoCache: Unknown
57.7No public latency60.2
OpenRouter
Deepseek R1 0528
lambda_ai/deepseek-r1-0528
1 gapsBenchmark: live
openrouter$0.20$0.60$0.52163.8K131.1K
Reason: YesTools: YesVision: NoCache: Yes
2427.1No public latency60.2
LiteLLMOpenRouter
LFM2-24B-A2B
liquid/lfm-2-24b-a2b
4 gapsBenchmark: live
liquid$0.03$0.12$0.102128KNo public limit
Reason: NoTools: NoVision: NoCache: Unknown
3.610.5No public latency57.8
OpenRouter
Phi 4
deepinfra/microsoft/phi-4
3 gapsBenchmark: live
deepinfra$0.07$0.14$0.12616.4K16.4K
Reason: NoTools: YesVision: NoCache: Unknown
11.210.4No public latency57.1
LiteLLMOpenRouter
GPT 4.1 Nano
azure/gpt-4.1-nano
1 gapsBenchmark: live
vercel_ai_gateway$0.10$0.40$0.341M32.8K
Reason: NoTools: YesVision: YesCache: Yes
11.213No public latency55.9
LiteLLMOpenRouter
Qwen3.6 35B A3B
qwen/qwen3.6-35b-a3b
2 gapsBenchmark: live
qwen$0.15$1.00$0.83262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
35.243.5No public latency55.1
OpenRouter
Nova Micro 1.0
amazon-nova/nova-micro-v1
2 gapsBenchmark: live
amazon_nova$0.035$0.14$0.119128K10K
Reason: NoTools: YesVision: NoCache: Yes
4.110.3No public latency53.8
LiteLLMOpenRouter
Gemma 3 27b It
deepinfra/google/gemma-3-27b-it
3 gapsBenchmark: live
deepinfra$0.09$0.16$0.146131.1K131.1K
Reason: NoTools: YesVision: YesCache: Unknown
9.610.3No public latency53.4
LiteLLMOpenRouter
Mercury 2
inception/mercury-2
2 gapsBenchmark: live
inception$0.25$0.75$0.65128K50K
Reason: YesTools: YesVision: NoCache: Yes
30.632.8No public latency52.9
LiteLLMOpenRouter
Codestral 2508
mistral/codestral-2508
4 gapsBenchmark: live
mistralai$0.30$0.90$0.78256K256K
Reason: NoTools: YesVision: NoCache: Yes
No scoreNo scoreNo public latency52.7
LiteLLMOpenRouter
Llama 4 Scout
meta-llama/llama-4-scout
2 gapsBenchmark: live
meta-llama$0.10$0.30$0.2610M16.4K
Reason: NoTools: YesVision: YesCache: Unknown
6.713.5No public latency51.5
OpenRouter
Mistral Small 4
mistralai/mistral-small-2603
3 gapsBenchmark: live
mistralai$0.15$0.60$0.51262.1KNo public limit
Reason: YesTools: YesVision: YesCache: Yes
24.327.8No public latency51
OpenRouter
Qwen3.7 Plus
qwen/qwen3.7-plus
2 gapsBenchmark: live
qwen$0.32$1.28$1.0881M65.5K
Reason: YesTools: YesVision: YesCache: Yes
46.553.3No public latency50.6
OpenRouter
MiniMax M2.7
sambanova/MiniMax-M2.7
1 gapsBenchmark: live
sambanova$0.30$1.20$1.02204.8K131.1K
Reason: YesTools: YesVision: NoCache: Yes
41.949.6No public latency50.2
LiteLLMOpenRouter
Gemma 3 4b It
deepinfra/google/gemma-3-4b-it
3 gapsBenchmark: live
deepinfra$0.04$0.08$0.072131.1K131.1K
Reason: NoTools: YesVision: YesCache: Unknown
2.96.3No public latency50
LiteLLMOpenRouter
Deepseek Chat V3.1
openrouter/deepseek/deepseek-chat-v3.1
1 gapsBenchmark: live
openrouter$0.20$0.80$0.68163.8K163.8K
Reason: YesTools: YesVision: NoCache: Yes
28.428.1No public latency49.6
LiteLLMOpenRouter
Qwen3 235B A22B
deepinfra/Qwen/Qwen3-235B-A22B
2 gapsBenchmark: live
hyperbolic$0.18$0.54$0.468262.1K262.1K
Reason: YesTools: YesVision: NoCache: Unknown
17.419.8No public latency48.9
LiteLLMOpenRouter
Solar Pro 3
upstage/solar-pro-3
3 gapsBenchmark: live
upstage$0.15$0.60$0.51128KNo public limit
Reason: YesTools: YesVision: NoCache: Yes
13.325.9No public latency48.4
OpenRouter
Trinity Large Thinking
arcee-ai/trinity-large-thinking
1 gapsBenchmark: live
arcee-ai$0.22$0.85$0.724262.1K262.1K
Reason: YesTools: YesVision: NoCache: Yes
27.231.9No public latency48.3
OpenRouter
Step 3.7 Flash
stepfun/step-3.7-flash
1 gapsBenchmark: live
stepfun$0.20$1.15$0.96256K256K
Reason: YesTools: YesVision: YesCache: Yes
37.142.6No public latency48.1
OpenRouter
Qwen3 Coder Next
qwen/qwen3-coder-next
2 gapsBenchmark: live
qwen$0.11$0.80$0.662262.1K262.1K
Reason: NoTools: YesVision: NoCache: Yes
22.928.3No public latency47
OpenRouter
Minimax.Minimax M2.5
minimax.minimax-m2.5
1 gapsBenchmark: live
bedrock_converse$0.30$1.20$1.021M196.6K
Reason: YesTools: YesVision: NoCache: Yes
37.441.9No public latency46.6
LiteLLMOpenRouter
KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2
2 gapsBenchmark: live
kwaipilot$0.30$1.20$1.02256K80K
Reason: NoTools: YesVision: NoCache: Yes
45.643.8No public latency45.8
OpenRouter
Deepseek V3.1 Terminus
novita/deepseek/deepseek-v3.1-terminus
1 gapsBenchmark: live
deepseek$0.27$1.00$0.854163.8K32.8K
Reason: YesTools: YesVision: NoCache: Yes
33.733.9No public latency44.6
LiteLLMOpenRouter
GPT 5.4 Nano
azure_ai/gpt-5.4-nano
2 gapsBenchmark: live
azure_ai$0.20$1.25$1.041.1M128K
Reason: YesTools: YesVision: YesCache: Yes
43.944No public latency43.5
LiteLLMOpenRouter
Minimax.Minimax M2.1
minimax.minimax-m2.1
1 gapsBenchmark: live
bedrock_converse$0.30$1.20$1.021M196.6K
Reason: YesTools: YesVision: YesCache: Yes
32.839.4No public latency43
LiteLLMOpenRouter
Llama 3.3 Nemotron Super 49B V1.5
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
3 gapsBenchmark: live
deepinfra$0.10$0.40$0.34131.1K131.1K
Reason: YesTools: YesVision: NoCache: Unknown
15.118.7No public latency42.4
LiteLLMOpenRouter
Hermes 4 70B
nousresearch/hermes-4-70b
4 gapsBenchmark: live
nousresearch$0.13$0.40$0.346131.1KNo public limit
Reason: YesTools: NoVision: NoCache: Unknown
14.416No public latency40.5
OpenRouter
Minimax.Minimax M2
minimax.minimax-m2
1 gapsBenchmark: live
bedrock_converse$0.30$1.20$1.02204.8K204.8K
Reason: YesTools: YesVision: NoCache: Yes
29.236.1No public latency39.8
LiteLLMOpenRouter
Qwen3 Coder 480B A35B
openrouter/qwen/qwen3-coder
2 gapsBenchmark: live
qwen$0.22$0.95$0.8041M262.1K
Reason: NoTools: YesVision: NoCache: Unknown
24.624.8No public latency39.7
LiteLLMOpenRouterBenchmark Seed
Nova Lite 1.0
amazon-nova/nova-lite-v1
2 gapsBenchmark: live
amazon_nova$0.06$0.24$0.204300K10K
Reason: NoTools: YesVision: YesCache: Yes
5.112.7No public latency38.7
LiteLLMOpenRouter
Qwen3 Vl 32b Instruct
dashscope/qwen3-vl-32b-instruct
3 gapsBenchmark: live
dashscope$0.16$0.64$0.544262.1K32.8K
Reason: NoTools: YesVision: YesCache: Unknown
14.524.7No public latency38.4
LiteLLMOpenRouter
Gemma 3n 4B
google/gemma-3n-e4b-it
4 gapsBenchmark: live
google$0.06$0.12$0.10832.8KNo public limit
Reason: NoTools: NoVision: NoCache: Unknown
4.26.4No public latency38
OpenRouter
Qwen 2.5 72b Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
5 gapsBenchmark: live
hyperbolic$0.12$0.39$0.336131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
11.9No scoreNo public latency35.4
LiteLLMOpenRouter
Hermes 3 Llama 3.1 70B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
3 gapsBenchmark: live
nousresearch$0.30$0.30$0.30131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
9.212.6No public latency35.3
LiteLLMOpenRouter
Reka Flash 3
rekaai/reka-flash-3
3 gapsBenchmark: live
rekaai$0.10$0.20$0.1865.5K65.5K
Reason: YesTools: NoVision: NoCache: Unknown
8.99.5No public latency33.9
OpenRouter
Glm 4.5 Air
vercel_ai_gateway/zai/glm-4.5-air
1 gapsBenchmark: live
vercel_ai_gateway$0.20$1.10$0.92131.1K131.1K
Reason: YesTools: YesVision: NoCache: Yes
23.823.2No public latency32.8
LiteLLMOpenRouter
Gemini 2.5 Flash Lite
gemini-2.5-flash-lite
2 gapsBenchmark: live
vertex_ai-language-models$0.10$0.40$0.341M65.5K
Reason: YesTools: YesVision: YesCache: Yes
9.517.6No public latency32.7
LiteLLMOpenRouter
Qwen3 Vl 8b Instruct
novita/qwen/qwen3-vl-8b-instruct
3 gapsBenchmark: live
novita$0.08$0.50$0.416256K32.8K
Reason: NoTools: YesVision: YesCache: Unknown
7.314.3No public latency32
LiteLLMOpenRouter
Qwen3.6 Plus
openrouter/qwen/qwen3.6-plus
1 gapsBenchmark: live
openrouter$0.325$1.95$1.6251M65.5K
Reason: YesTools: YesVision: YesCache: Yes
42.950No public latency31.8
LiteLLMOpenRouter
Mimo V2.5
openrouter/xiaomi/mimo-v2.5
1 gapsBenchmark: live
openrouter$0.40$2.00$1.681M131.1K
Reason: YesTools: YesVision: YesCache: Yes
42.149No public latency31.3
LiteLLMOpenRouter
Deepseek V3.2
azure_ai/deepseek-v3.2
1 gapsBenchmark: live
bedrock_converse$0.58$1.68$1.46163.8K163.8K
Reason: YesTools: YesVision: NoCache: Yes
36.741.7No public latency30.7
LiteLLMOpenRouter
MiniMax M3
minimax/MiniMax-M3
1 gapsBenchmark: live
minimax$0.60$2.40$2.041M512K
Reason: YesTools: YesVision: YesCache: Yes
43.454.7No public latency27.7
LiteLLMOpenRouter
INTELLECT-3
prime-intellect/intellect-3
2 gapsBenchmark: live
prime-intellect$0.20$1.10$0.92131.1K131.1K
Reason: YesTools: YesVision: NoCache: Unknown
19.122.2No public latency27.2
OpenRouter
Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-image
4 gapsBenchmark: live
google$0.30$2.50$2.0632.8K32.8K
Reason: NoTools: NoVision: YesCache: Yes
No scoreNo scoreNo public latency26.6
OpenRouter
Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-image-preview
5 gapsBenchmark: live
google$0.50$3.00$2.50131.1K65.5K
Reason: YesTools: NoVision: YesCache: Unknown
No scoreNo scoreNo public latency26.1
OpenRouter
GLM 4.6V
z-ai/glm-4.6v
2 gapsBenchmark: live
z-ai$0.30$0.90$0.78131.1K32.8K
Reason: YesTools: YesVision: YesCache: Yes
19.723.4No public latency25.9
OpenRouter
Qwen3.5 122b A10b
openrouter/qwen/qwen3.5-122b-a10b
3 gapsBenchmark: live
openrouter$0.40$2.00$1.68262.1K262.1K
Reason: YesTools: YesVision: YesCache: Unknown
34.741.6No public latency25.7
LiteLLMOpenRouter
Gemini 3.1 Flash Lite Preview
gemini-3.1-flash-lite-preview
1 gapsBenchmark: live
vertex_ai-language-models$0.25$1.50$1.251M65.5K
Reason: YesTools: YesVision: YesCache: Yes
30.133.5No public latency25.6
LiteLLMOpenRouter
GPT 5 Mini
azure/gpt-5-mini
1 gapsBenchmark: live
github_copilot$0.25$2.00$1.65400K128K
Reason: YesTools: YesVision: YesCache: Yes
35.341.2No public latency25.2
LiteLLMOpenRouter
GPT-5 Image Mini
openai/gpt-5-image-mini
4 gapsBenchmark: live
openai$2.50$2.00$2.10400K128K
Reason: YesTools: NoVision: YesCache: Yes
No scoreNo scoreNo public latency24.7
OpenRouter
Zai.Glm 5
openrouter/z-ai/glm-5
1 gapsBenchmark: live
bedrock_converse$0.80$2.56$2.208202.8K128K
Reason: YesTools: YesVision: NoCache: Yes
44.249.8No public latency24
LiteLLMOpenRouter
Llama 4 Maverick
snowflake/llama4-maverick
2 gapsBenchmark: live
meta-llama$0.24$0.97$0.8241M16.4K
Reason: NoTools: YesVision: YesCache: Unknown
15.618.4No public latency23.8
LiteLLMOpenRouter
Qwen3.5 Plus 2026-02-15
openrouter/qwen/qwen3.5-plus-02-15
5 gapsBenchmark: live
openrouter$0.40$2.40$2.001M65.5K
Reason: YesTools: YesVision: YesCache: Unknown
No scoreNo scoreNo public latency23.7
LiteLLMOpenRouter
Qwen3 Next 80b A3b Thinking
dashscope/qwen3-next-80b-a3b-thinking
3 gapsBenchmark: live
together_ai$0.15$1.20$0.99262.1K262.1K
Reason: YesTools: YesVision: NoCache: Unknown
19.526.7No public latency23.5
LiteLLMOpenRouter
GPT 5.1 Codex Mini
azure/gpt-5.1-codex-mini
1 gapsBenchmark: live
chatgpt$0.25$2.00$1.65400K128K
Reason: YesTools: YesVision: YesCache: Yes
36.438.6No public latency23.3
LiteLLMOpenRouter
Llama 3 2 1b Instruct
watsonx/meta-llama/llama-3-2-1b-instruct
3 gapsBenchmark: live
meta-llama$0.10$0.10$0.10131.1K128K
Reason: NoTools: YesVision: NoCache: Unknown
0.66.3No public latency23
LiteLLMOpenRouter
Nemotron 3 Ultra
nvidia/nemotron-3-ultra-550b-a55b
2 gapsBenchmark: live
nvidia$0.50$2.50$2.101M16.4K
Reason: YesTools: YesVision: NoCache: Yes
37.647.7No public latency22.6
OpenRouter
Qwen3.5 35b A3b
openrouter/qwen/qwen3.5-35b-a3b
2 gapsBenchmark: live
openrouter$0.25$2.00$1.65262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
30.337.1No public latency22.6
LiteLLMOpenRouter
Grok 4.3
xai/grok-4.3
1 gapsBenchmark: live
x-ai$1.25$2.50$2.251M1M
Reason: YesTools: YesVision: YesCache: Yes
4153.2No public latency22.5
LiteLLMOpenRouter
Qwen3.5 27b
openrouter/qwen/qwen3.5-27b
3 gapsBenchmark: live
openrouter$0.30$2.40$1.98262.1K65.5K
Reason: YesTools: YesVision: YesCache: Unknown
34.942.1No public latency22.2
LiteLLMOpenRouter
Qwen3 Vl 30b A3b Instruct
novita/qwen/qwen3-vl-30b-a3b-instruct
3 gapsBenchmark: live
novita$0.20$0.70$0.60262.1K32.8K
Reason: NoTools: YesVision: YesCache: Unknown
14.316No public latency22.2
LiteLLMOpenRouter
Mimo V2.5 Pro
openrouter/xiaomi/mimo-v2.5-pro
1 gapsBenchmark: live
openrouter$1.00$3.00$2.601M131.1K
Reason: YesTools: YesVision: NoCache: Yes
45.553.8No public latency22
LiteLLMOpenRouter
Mistral Large 3 2512
mistral/mistral-large-2512
1 gapsBenchmark: live
openrouter$0.50$1.50$1.30262.1K262.1K
Reason: NoTools: YesVision: YesCache: Yes
22.722.8No public latency21.4
LiteLLMOpenRouter
GLM 5.1
z-ai/glm-5.1
2 gapsBenchmark: live
z-ai$0.98$3.08$2.66202.8KNo public limit
Reason: YesTools: YesVision: NoCache: Yes
43.451.4No public latency20.8
OpenRouter
GPT 4.1 Mini
azure/gpt-4.1-mini
1 gapsBenchmark: live
vercel_ai_gateway$0.40$1.60$1.361M32.8K
Reason: NoTools: YesVision: YesCache: Yes
18.522.9No public latency20.8
LiteLLMOpenRouter
Qwen 3 32b
cerebras/qwen-3-32b
3 gapsBenchmark: live
deepinfra$0.40$0.80$0.72131.1K131K
Reason: YesTools: YesVision: NoCache: Unknown
13.816.5No public latency20.3
LiteLLMOpenRouter
Cogito v2.1 671B
deepcogito/cogito-v2.1-671b
6 gapsBenchmark: live
deepcogito$1.25$1.25$1.25128KNo public limit
Reason: YesTools: NoVision: NoCache: Unknown
24.8No scoreNo public latency19.8
OpenRouter
Moonshotai.Kimi K2.5
bedrock/moonshotai.kimi-k2.5
1 gapsBenchmark: live
bedrock_converse$0.60$3.03$2.544262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
39.646.8No public latency19.6
LiteLLMOpenRouter
Gemini 3 Flash Preview
vertex_ai/gemini-3-flash-preview
1 gapsBenchmark: live
vertex_ai-language-models$0.50$3.00$2.501M65.5K
Reason: YesTools: YesVision: YesCache: Yes
42.646.4No public latency19.1
LiteLLMOpenRouter
Qwen3.6 27B
qwen/qwen3.6-27b
3 gapsBenchmark: live
qwen$0.287$3.10$2.5374262.1K262.1K
Reason: YesTools: YesVision: YesCache: Unknown
36.545.8No public latency19.1
OpenRouter
Qwen3 Vl 30b A3b Thinking
novita/qwen/qwen3-vl-30b-a3b-thinking
3 gapsBenchmark: live
novita$0.20$1.00$0.84131.1K32.8K
Reason: YesTools: YesVision: YesCache: Unknown
13.119.7No public latency18.9
LiteLLMOpenRouter
Qwen3.7 Max
qwen/qwen3.7-max
1 gapsBenchmark: live
qwen$1.25$3.75$3.251M65.5K
Reason: YesTools: YesVision: NoCache: Yes
50.156.6No public latency17.6
OpenRouter
Llama 3 8b Instruct
gradient_ai/llama3-8b-instruct
3 gapsBenchmark: live
gradient_ai$0.20$0.20$0.208.2K8.2K
Reason: NoTools: NoVision: NoCache: Unknown
46.4No public latency17.5
LiteLLMOpenRouter
Zai Glm 4.7
cerebras/zai-glm-4.7
1 gapsBenchmark: live
bedrock_converse$2.25$2.75$2.65202.8K131.1K
Reason: YesTools: YesVision: YesCache: Yes
36.342.1No public latency17.5
LiteLLMOpenRouter
Kimi K2 0905
novita/moonshotai/kimi-k2-0905
2 gapsBenchmark: live
moonshotai$0.60$2.50$2.12262.1K262.1K
Reason: NoTools: YesVision: NoCache: Unknown
25.930.9No public latency16.8
LiteLLMOpenRouter
Moonshotai.Kimi K2 Thinking
bedrock/moonshotai.kimi-k2-thinking
2 gapsBenchmark: live
moonshotai$0.73$3.03$2.57262.1K262.1K
Reason: YesTools: YesVision: NoCache: Unknown
34.840.9No public latency16.8
LiteLLMOpenRouter
Qwen3 Next 80b A3b Instruct
dashscope/qwen3-next-80b-a3b-instruct
3 gapsBenchmark: live
together_ai$0.15$1.20$0.99262.1K262.1K
Reason: NoTools: YesVision: NoCache: Unknown
15.320.1No public latency16.7
LiteLLMOpenRouter
Kimi K2 0711
vercel_ai_gateway/moonshotai/kimi-k2
2 gapsBenchmark: live
vercel_ai_gateway$0.55$2.20$1.87131.1K32.8K
Reason: NoTools: YesVision: NoCache: Unknown
22.126.3No public latency16.6
LiteLLMOpenRouter
Glm 4.5
vercel_ai_gateway/zai/glm-4.5
1 gapsBenchmark: live
vercel_ai_gateway$0.60$2.20$1.88131.1K131.1K
Reason: YesTools: YesVision: NoCache: Yes
26.326.4No public latency16.4
LiteLLMOpenRouter
Kimi K2.6
azure_ai/kimi-k2.6
1 gapsBenchmark: live
moonshotai$0.95$4.00$3.39262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
47.153.9No public latency16.4
LiteLLMOpenRouter
Qwen3.5 397b A17b
openrouter/qwen/qwen3.5-397b-a17b
2 gapsBenchmark: live
together_ai$0.60$3.60$3.00262.1K65.5K
Reason: YesTools: YesVision: YesCache: Unknown
41.345No public latency16.3
LiteLLMOpenRouter
Llama 3.3 70B Instruct
azure_ai/Llama-3.3-70B-Instruct
3 gapsBenchmark: live
gradient_ai$0.71$0.71$0.71131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
10.714.5No public latency16.1
LiteLLMOpenRouter
Mistral Medium 3.1
mistralai/mistral-medium-3.1
2 gapsBenchmark: live
mistralai$0.40$2.00$1.68131.1KNo public limit
Reason: NoTools: YesVision: YesCache: Yes
18.321.3No public latency16
OpenRouter
Llama 3.2 11B Vision Instruct
azure_ai/Llama-3.2-11B-Vision-Instruct
3 gapsBenchmark: live
meta-llama$0.37$0.37$0.37131.1K131.1K
Reason: NoTools: YesVision: YesCache: Unknown
4.28.7No public latency16
LiteLLMOpenRouter
Hermes 3 Llama 3.1 405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
3 gapsBenchmark: live
nousresearch$1.00$1.00$1.00131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
18.117.6No public latency15.8
LiteLLMOpenRouter
GLM 5 Turbo
z-ai/glm-5-turbo
1 gapsBenchmark: live
z-ai$1.20$4.00$3.44262.1K131.1K
Reason: YesTools: YesVision: NoCache: Yes
36.846.8No public latency15.1
OpenRouter
Zai Glm 4.6
cerebras/zai-glm-4.6
1 gapsBenchmark: live
vercel_ai_gateway$2.25$2.75$2.65202.8K200K
Reason: YesTools: YesVision: NoCache: Yes
29.532.5No public latency14.7
LiteLLMOpenRouter
GPT-5.4 mini
azure_ai/gpt-5.4-mini
2 gapsBenchmark: live
azure_ai$0.75$4.50$3.751.1M128K
Reason: YesTools: YesVision: YesCache: Yes
51.548.9No public latency14.2
LiteLLMOpenRouterOfficial Catalog
Mistral Medium 3
vertex_ai/mistral-medium-3
1 gapsBenchmark: live
vertex_ai-mistral_models$0.40$2.00$1.68131.1K8.2K
Reason: NoTools: YesVision: YesCache: Yes
13.618.8No public latency14.1
LiteLLMOpenRouter
Nova 2 Lite
amazon/nova-2-lite-v1
3 gapsBenchmark: live
amazon$0.30$2.50$2.061M65.5K
Reason: YesTools: YesVision: YesCache: Unknown
23.929.7No public latency14
OpenRouter
Gemini 2.5 Flash
deepinfra/google/gemini-2.5-flash
1 gapsBenchmark: live
vertex_ai-language-models$0.30$2.50$2.061M1M
Reason: YesTools: YesVision: YesCache: Yes
22.227No public latency13.9
LiteLLMOpenRouter
Qwen3 Vl 235b A22b Instruct
dashscope/qwen3-vl-235b-a22b-instruct
2 gapsBenchmark: live
dashscope$0.40$1.60$1.36262.1K32.8K
Reason: NoTools: YesVision: YesCache: Yes
16.520.8No public latency13.8
LiteLLMOpenRouter
Devstral 2 2512
mistral/devstral-2512
2 gapsBenchmark: live
openrouter$0.40$2.00$1.68262.1K256K
Reason: NoTools: YesVision: NoCache: Yes
23.722No public latency13.4
LiteLLMOpenRouter
Qwen3 235B A22B Thinking 2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
1 gapsBenchmark: live
together_ai$0.30$2.90$2.38262.1K262.1K
Reason: YesTools: YesVision: NoCache: Yes
23.229.5No public latency12.6
LiteLLMOpenRouter
Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
3 gapsBenchmark: live
qwen$0.117$1.365$1.1154256K32.8K
Reason: YesTools: YesVision: YesCache: Unknown
9.816.7No public latency12.6
OpenRouter
MiniMax M1
minimax/minimax-m1
3 gapsBenchmark: live
minimax$0.40$2.20$1.841M40K
Reason: YesTools: YesVision: NoCache: Unknown
14.524.4No public latency12.3
OpenRouter
Qwen3 Max Thinking
qwen/qwen3-max-thinking
3 gapsBenchmark: live
qwen$0.78$3.90$3.276262.1K32.8K
Reason: YesTools: YesVision: NoCache: Unknown
30.539.8No public latency12.2
OpenRouter
Deepseek R1 Distill Llama 70b
gradient_ai/deepseek-r1-distill-llama-70b
5 gapsBenchmark: live
vercel_ai_gateway$0.99$0.99$0.99131.1K131.1K
Reason: YesTools: YesVision: NoCache: Unknown
11.4No scoreNo public latency11.5
LiteLLMOpenRouter
Qwen3.6 Max Preview
qwen/qwen3.6-max-preview
2 gapsBenchmark: live
qwen$1.04$6.24$5.20262.1K65.5K
Reason: YesTools: YesVision: NoCache: Yes
44.951.8No public latency10.4
OpenRouter
Llama 3.1 70b Instruct
perplexity/llama-3.1-70b-instruct
3 gapsBenchmark: live
perplexity$1.00$1.00$1.00131.1K131.1K
Reason: NoTools: YesVision: NoCache: Unknown
10.912.5No public latency9.5
LiteLLMOpenRouter
O4 Mini
azure/o4-mini
1 gapsBenchmark: live
vercel_ai_gateway$1.10$4.40$3.74200K100K
Reason: YesTools: YesVision: YesCache: Yes
25.633.1No public latency9.4
LiteLLMOpenRouter
Claude Haiku 4.5
azure_ai/claude-haiku-4-5
1 gapsBenchmark: live
vertex_ai-anthropic_models$1.00$5.00$4.20200K200K
Reason: YesTools: YesVision: YesCache: Yes
32.637.1No public latency9.2
LiteLLMOpenRouterOfficial Catalog
Claude 3 Haiku
openrouter/anthropic/claude-3-haiku
2 gapsBenchmark: live
vertex_ai-anthropic_models$0.25$1.25$1.05200K4.1K
Reason: NoTools: YesVision: YesCache: Yes
6.712.3No public latency8.3
LiteLLMOpenRouter
Ft:GPT 3.5 Turbo
azure/gpt-3.5-turbo
4 gapsBenchmark: live
vercel_ai_gateway$0.50$1.50$1.3016.4K4.1K
Reason: NoTools: YesVision: NoCache: Yes
10.7No scoreNo public latency8.2
LiteLLMOpenRouter
Glm 4.5v
zai/glm-4.5v
2 gapsBenchmark: live
z-ai$0.60$1.80$1.56128K32K
Reason: YesTools: YesVision: YesCache: Yes
10.915.1No public latency7.9
LiteLLMOpenRouter
Qwen3 Vl 235b A22b Thinking
dashscope/qwen3-vl-235b-a22b-thinking
3 gapsBenchmark: live
dashscope$0.40$4.00$3.28131.1K32.8K
Reason: YesTools: YesVision: YesCache: Unknown
20.927.6No public latency7.7
LiteLLMOpenRouter
Gemini 3.5 Flash
vertex_ai/gemini-3.5-flash
1 gapsBenchmark: live
vertex_ai-language-models$1.50$9.00$7.501M65.5K
Reason: YesTools: YesVision: YesCache: Yes
4555.3No public latency7.7
LiteLLMOpenRouter
Llama 3 70b Instruct
openrouter/meta-llama/llama-3-70b-instruct
3 gapsBenchmark: live
openrouter$0.59$0.79$0.758.2K8K
Reason: NoTools: NoVision: NoCache: Unknown
6.88.9No public latency6.9
LiteLLMOpenRouter
Mistral Medium 3.5
mistralai/mistral-medium-3-5
4 gapsBenchmark: live
mistralai$1.50$7.50$6.30262.1KNo public limit
Reason: YesTools: YesVision: YesCache: Unknown
35.439.2No public latency6.8
OpenRouter
Nano Banana Pro (Gemini 3 Pro Image Preview)
google/gemini-3-pro-image-preview
4 gapsBenchmark: live
google$2.00$12.00$10.0065.5K32.8K
Reason: YesTools: NoVision: YesCache: Yes
No scoreNo scoreNo public latency6.4
OpenRouter
O3
azure/o3
1 gapsBenchmark: live
vercel_ai_gateway$2.00$8.00$6.80200K100K
Reason: YesTools: YesVision: YesCache: Yes
38.438.4No public latency6.1
LiteLLMOpenRouter
Hermes 4 405B
nousresearch/hermes-4-405b
4 gapsBenchmark: live
nousresearch$1.00$3.00$2.60131.1KNo public limit
Reason: YesTools: NoVision: NoCache: Unknown
1618.6No public latency6
OpenRouter
GPT 5.1
azure/gpt-5.1
1 gapsBenchmark: live
github_copilot$1.25$10.00$8.25409.6K128K
Reason: YesTools: YesVision: YesCache: Yes
44.747.7No public latency5.9
LiteLLMOpenRouter
GPT 5
azure/gpt-5
1 gapsBenchmark: live
github_copilot$1.25$10.00$8.25409.6K128K
Reason: YesTools: YesVision: YesCache: Yes
3644.6No public latency5.8
LiteLLMOpenRouter
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview
1 gapsBenchmark: live
vertex_ai-language-models$2.00$12.00$10.001M65.5K
Reason: YesTools: YesVision: YesCache: Yes
55.557.2No public latency5.7
LiteLLMOpenRouter
O3 Mini High
openrouter/openai/o3-mini-high
2 gapsBenchmark: live
openrouter$1.10$4.40$3.74200K100K
Reason: YesTools: YesVision: NoCache: Yes
17.325.2No public latency5.6
LiteLLMOpenRouter
GPT-5 Image
openai/gpt-5-image
4 gapsBenchmark: live
openai$10.00$10.00$10.00400K128K
Reason: YesTools: NoVision: YesCache: Yes
No scoreNo scoreNo public latency5.5
OpenRouter
GPT 5.1 Codex
azure/gpt-5.1-codex
1 gapsBenchmark: live
openai$1.25$10.00$8.25400K128K
Reason: YesTools: YesVision: YesCache: Yes
36.643.1No public latency5.5
LiteLLMOpenRouter
GPT 5 Codex
azure/gpt-5-codex
1 gapsBenchmark: live
openrouter$1.25$10.00$8.25400K128K
Reason: YesTools: YesVision: YesCache: Yes
38.944.6No public latency5.4
LiteLLMOpenRouter
Qwen3 Max
dashscope/qwen3-max
1 gapsBenchmark: live
dashscope$2.11$8.45$7.182262.1K65.5K
Reason: YesTools: YesVision: NoCache: Yes
26.431.4No public latency5
LiteLLMOpenRouter
Gemini 2.5 Pro
deepinfra/google/gemini-2.5-pro
1 gapsBenchmark: live
google$1.25$10.00$8.251M1M
Reason: YesTools: YesVision: YesCache: Yes
3234.6No public latency4.8
LiteLLMOpenRouterBenchmark Seed
O3 Mini
azure/o3-mini
4 gapsBenchmark: live
vercel_ai_gateway$1.10$4.40$3.74200K100K
Reason: YesTools: YesVision: NoCache: Yes
17.9No scoreNo public latency4.8
LiteLLMOpenRouter
GPT-5.4
azure_ai/gpt-5.4
1 gapsBenchmark: live
azure_ai$2.50$15.00$12.501.1M128K
Reason: YesTools: YesVision: YesCache: Yes
57.256.8No public latency4.7
LiteLLMOpenRouterOfficial Catalog
GPT 4.1
azure/gpt-4.1
1 gapsBenchmark: live
openai$2.00$8.00$6.801M32.8K
Reason: NoTools: YesVision: YesCache: Yes
21.826.3No public latency4.6
LiteLLMOpenRouterBenchmark Seed
Nova Pro 1.0
amazon-nova/nova-pro-v1
1 gapsBenchmark: live
amazon_nova$0.80$3.20$2.72300K10K
Reason: NoTools: YesVision: YesCache: Yes
1113.5No public latency4.6
LiteLLMOpenRouter
GPT 5.3 Codex
azure/gpt-5.3-codex
1 gapsBenchmark: live
github_copilot$1.75$14.00$11.55400K128K
Reason: YesTools: YesVision: YesCache: Yes
53.153.6No public latency4.6
LiteLLMOpenRouter
GPT 5.2
azure/gpt-5.2
1 gapsBenchmark: live
github_copilot$1.75$14.00$11.55409.6K128K
Reason: YesTools: YesVision: YesCache: Yes
48.751.3No public latency4.6
LiteLLMOpenRouter
Claude Sonnet 4.6
azure_ai/claude-sonnet-4-6
1 gapsBenchmark: live
vertex_ai-anthropic_models$3.00$15.00$12.601M128K
Reason: YesTools: YesVision: YesCache: Yes
50.951.7No public latency4.5
LiteLLMOpenRouterOfficial Catalog
GPT 4o
azure/gpt-4o
4 gapsBenchmark: live
vercel_ai_gateway$2.50$10.00$8.50131.1K16.4K
Reason: NoTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latency4.4
LiteLLMOpenRouter
GPT 5.2 Codex
azure/gpt-5.2-codex
1 gapsBenchmark: live
openrouter$1.75$14.00$11.55400K128K
Reason: YesTools: YesVision: YesCache: Yes
4349No public latency4.2
LiteLLMOpenRouter
Xai.Grok 4.20
oci/xai.grok-4.20
2 gapsBenchmark: live
x-ai$3.00$15.00$12.602M131.1K
Reason: YesTools: YesVision: YesCache: Yes
40.549.3No public latency3.8
LiteLLMOpenRouter
Claude 3 5 Haiku
heroku/claude-3-5-haiku
2 gapsBenchmark: live
vertex_ai-anthropic_models$1.00$5.00$4.20200K8.2K
Reason: NoTools: YesVision: YesCache: Yes
10.718.7No public latency3.7
LiteLLMOpenRouter
Claude Sonnet 4 5
azure_ai/claude-sonnet-4-5
1 gapsBenchmark: live
anthropic$3.00$15.00$12.601M1M
Reason: YesTools: YesVision: YesCache: Yes
38.643No public latency3.7
LiteLLMOpenRouterBenchmark Seed
Claude Sonnet 4
github_copilot/claude-sonnet-4
1 gapsBenchmark: live
vertex_ai-anthropic_models$3.00$15.00$12.601M64K
Reason: YesTools: YesVision: YesCache: Yes
34.138.7No public latency3.4
LiteLLMOpenRouter
Claude Opus 4.8
azure_ai/claude-opus-4-8
1 gapsBenchmark: live
vertex_ai-anthropic_models$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
56.761.4No public latency3
LiteLLMOpenRouterOfficial Catalog
Claude Opus 4 7
azure_ai/claude-opus-4-7
1 gapsBenchmark: live
vertex_ai-anthropic_models$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
52.557.3No public latency2.9
LiteLLMOpenRouter
Deepseek R1
azure_ai/deepseek-r1
2 gapsBenchmark: live
deepseek$1.35$5.40$4.59163.8K32.8K
Reason: YesTools: YesVision: NoCache: Yes
15.918.8No public latency2.8
LiteLLMOpenRouterBenchmark Seed
Claude Opus 4 6
azure_ai/claude-opus-4-6
1 gapsBenchmark: live
vertex_ai-anthropic_models$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
48.152.9No public latency2.8
LiteLLMOpenRouter
Claude Opus 4 5
azure_ai/claude-opus-4-5
1 gapsBenchmark: live
vertex_ai-anthropic_models$5.00$25.00$21.00409.6K64K
Reason: YesTools: YesVision: YesCache: Yes
47.849.7No public latency2.6
LiteLLMOpenRouter
GPT 5 Chat
azure/gpt-5-chat
4 gapsBenchmark: live
openrouter$1.25$10.00$8.25128K16.4K
Reason: YesTools: YesVision: YesCache: Yes
21.2No scoreNo public latency2.6
LiteLLMOpenRouter
Mistral Large 2407
azure_ai/mistral-large-2407
2 gapsBenchmark: live
vertex_ai-mistral_models$2.00$6.00$5.20131.1K128K
Reason: NoTools: YesVision: NoCache: Yes
13.815.1No public latency2.5
LiteLLMOpenRouter
GPT-5.5
azure/gpt-5.5
1 gapsBenchmark: live
openai$5.00$30.00$25.001.1M128K
Reason: YesTools: YesVision: YesCache: Yes
59.160.2No public latency2.5
LiteLLMOpenRouterOfficial Catalog
GPT-4o (2024-05-13)
azure/gpt-4o-2024-05-13
4 gapsBenchmark: live
github_copilot$5.00$15.00$13.00128K4.1K
Reason: NoTools: YesVision: YesCache: Yes
24.2No scoreNo public latency1.9
LiteLLMOpenRouter
Nova Premier 1.0
amazon-nova/nova-premier-v1
1 gapsBenchmark: live
amazon_nova$2.50$12.50$10.501M32K
Reason: NoTools: YesVision: YesCache: Yes
13.819No public latency1.8
LiteLLMOpenRouter
Ft:GPT 4o 2024 08 06
azure/gpt-4o-2024-08-06
2 gapsBenchmark: live
github_copilot$2.50$10.00$8.50128K16.4K
Reason: NoTools: YesVision: YesCache: Yes
16.618.6No public latency1.8
LiteLLMOpenRouter
Claude Fable 5
azure_ai/claude-fable-5
1 gapsBenchmark: live
vertex_ai-anthropic_models$10.00$50.00$42.001M128K
Reason: YesTools: YesVision: YesCache: Yes
6264.9No public latency1.7
LiteLLMOpenRouterOfficial Catalog
Ft:GPT 4o 2024 11 20
azure/gpt-4o-2024-11-20
2 gapsBenchmark: live
github_copilot$2.75$11.00$9.35128K16.4K
Reason: NoTools: YesVision: YesCache: Yes
16.717.3No public latency1.5
LiteLLMOpenRouter
Jamba Large 1.7
jamba-large-1.7
3 gapsBenchmark: live
ai21$2.00$8.00$6.80256K256K
Reason: NoTools: YesVision: NoCache: Unknown
7.810.9No public latency1.1
LiteLLMOpenRouter
GPT 4 Turbo
azure/gpt-4-turbo
4 gapsBenchmark: live
vercel_ai_gateway$10.00$30.00$26.00128K4.1K
Reason: NoTools: YesVision: YesCache: Yes
21.5No scoreNo public latency0.8
LiteLLMOpenRouter
Claude Opus 4 1
azure_ai/claude-opus-4-1
3 gapsBenchmark: live
vertex_ai-anthropic_models$15.00$75.00$63.00200K32K
Reason: YesTools: YesVision: YesCache: Yes
36.5No scoreNo public latency0.7
LiteLLMOpenRouter
Claude Opus 4
gmi/anthropic/claude-opus-4
3 gapsBenchmark: live
vertex_ai-anthropic_models$15.00$75.00$63.00409.6K32K
Reason: YesTools: YesVision: YesCache: Yes
34No scoreNo public latency0.7
LiteLLMOpenRouter
O1
azure/o1
2 gapsBenchmark: live
vercel_ai_gateway$15.00$60.00$51.00200K100K
Reason: YesTools: YesVision: YesCache: Yes
20.530.7No public latency0.5
LiteLLMOpenRouter
GPT 4
azure/gpt-4
4 gapsBenchmark: live
github_copilot$30.00$60.00$54.0032.8K4.1K
Reason: NoTools: YesVision: NoCache: Yes
13.1No scoreNo public latency0.2
LiteLLMOpenRouter
Phi 4 Mini Instruct
wandb/microsoft/Phi-4-mini-instruct
2 gapsBenchmark: live
microsoft$8,000.00$35,000.00$29,600.00131.1K128K
Reason: NoTools: NoVision: NoCache: Yes
3.68.4No public latency0
LiteLLMOpenRouter
Llama 3.1 405b Instruct
meta-llama/llama-3.1-405b-instruct
9 gapsBenchmark: stale estimate
meta-llamaNo public priceNo public priceNo public priceNo public limitNo public limit
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
6668No public latencyNo score
Benchmark Seed
Anthropic Claude Haiku Latest
~anthropic/claude-haiku-latest
5 gapsBenchmark: missing
~anthropic$1.00$5.00$4.20200K64K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Anthropic Claude Sonnet Latest
~anthropic/claude-sonnet-latest
5 gapsBenchmark: missing
~anthropic$3.00$15.00$12.601M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Claude Fable Latest
~anthropic/claude-fable-latest
5 gapsBenchmark: missing
~anthropic$10.00$50.00$42.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Claude Opus Latest
~anthropic/claude-opus-latest
5 gapsBenchmark: missing
~anthropic$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Google Gemini Flash Latest
gemini/gemini-flash-latest
5 gapsBenchmark: missing
~google$0.30$2.50$2.061M65.5K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLMOpenRouter
Google Gemini Pro Latest
gemini-pro-latest
5 gapsBenchmark: missing
~google$1.25$10.00$8.251M65.5K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLMOpenRouter
MoonshotAI Kimi Latest
moonshot/kimi-latest
5 gapsBenchmark: missing
~moonshotai$2.00$5.00$4.40262.1K262.1K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLMOpenRouter
OpenAI GPT Latest
~openai/gpt-latest
5 gapsBenchmark: missing
~openai$5.00$30.00$25.001.1M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
OpenAI GPT Mini Latest
~openai/gpt-mini-latest
5 gapsBenchmark: missing
~openai$0.75$4.50$3.75400K128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
J2 Light
j2-light
9 gapsBenchmark: missing
ai21$3.00$3.00$3.008.2K8.2K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
J2 Mid
j2-mid
9 gapsBenchmark: missing
ai21$10.00$10.00$10.008.2K8.2K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
J2 Ultra
j2-ultra
9 gapsBenchmark: missing
ai21$15.00$15.00$15.008.2K8.2K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Jamba Large 1.6
jamba-large-1.6
9 gapsBenchmark: missing
ai21$2.00$8.00$6.80256K256K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Jamba Mini 1.6
jamba-mini-1.6
9 gapsBenchmark: missing
ai21$0.20$0.40$0.36256K256K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Jamba Mini 1.7
jamba-mini-1.7
9 gapsBenchmark: missing
ai21$0.20$0.40$0.36256K256K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Aion-1.0
aion-labs/aion-1.0
6 gapsBenchmark: missing
aion-labs$4.00$8.00$7.20131.1K32.8K
Reason: YesTools: NoVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Aion-1.0-Mini
aion-labs/aion-1.0-mini
6 gapsBenchmark: missing
aion-labs$0.70$1.40$1.26131.1K32.8K
Reason: YesTools: NoVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Aion-2.0
aion-labs/aion-2.0
5 gapsBenchmark: missing
aion-labs$0.80$1.60$1.44131.1K32.8K
Reason: YesTools: NoVision: NoCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b
6 gapsBenchmark: missing
aion-labs$0.80$1.60$1.4432.8K32.8K
Reason: NoTools: NoVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Magnum v4 72B
anthracite-org/magnum-v4-72b
6 gapsBenchmark: missing
anthracite-org$3.00$5.00$4.6032.8K2K
Reason: NoTools: NoVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Claude 4 Opus 20250514
claude-4-opus-20250514
5 gapsBenchmark: missing
anthropic$15.00$75.00$63.00200K32K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Claude 4 Sonnet 20250514
claude-4-sonnet-20250514
5 gapsBenchmark: missing
anthropic$3.00$15.00$12.601M64K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Claude Opus 4 6 20260205
claude-opus-4-6-20260205
5 gapsBenchmark: missing
anthropic$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Claude Opus 4 7 20260416
claude-opus-4-7-20260416
5 gapsBenchmark: missing
anthropic$5.00$25.00$21.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Claude Opus 4.7 (Fast)
anthropic/claude-opus-4.7-fast
5 gapsBenchmark: missing
anthropic$30.00$150.00$126.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
Claude Opus 4.8 (Fast)
anthropic/claude-opus-4.8-fast
5 gapsBenchmark: missing
anthropic$10.00$50.00$42.001M128K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
OpenRouter
CodeLlama 34b Instruct Hf
anyscale/codellama/CodeLlama-34b-Instruct-hf
9 gapsBenchmark: missing
anyscale$1.00$1.00$1.004.1K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
CodeLlama 70b Instruct Hf
anyscale/codellama/CodeLlama-70b-Instruct-hf
9 gapsBenchmark: missing
anyscale$1.00$1.00$1.004.1K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Gemma 7b It
anyscale/google/gemma-7b-it
8 gapsBenchmark: missing
anyscale$0.15$0.15$0.158.2K8.2K
Reason: UnknownTools: YesVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Llama 2 13b Chat Hf
anyscale/meta-llama/Llama-2-13b-chat-hf
9 gapsBenchmark: missing
anyscale$0.25$0.25$0.254.1K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Llama 2 70b Chat Hf
anyscale/meta-llama/Llama-2-70b-chat-hf
9 gapsBenchmark: missing
anyscale$1.00$1.00$1.004.1K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Llama 2 7b Chat Hf
anyscale/meta-llama/Llama-2-7b-chat-hf
9 gapsBenchmark: missing
anyscale$0.15$0.15$0.154.1K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Mixtral 8x22B Instruct V0.1
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1
8 gapsBenchmark: missing
anyscale$0.90$0.90$0.9065.5K65.5K
Reason: UnknownTools: YesVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Zephyr 7b Beta
anyscale/HuggingFaceH4/zephyr-7b-beta
9 gapsBenchmark: missing
anyscale$0.15$0.15$0.1516.4K16.4K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
Coder Large
arcee-ai/coder-large
7 gapsBenchmark: missing
arcee-ai$0.50$0.80$0.7432.8KNo public limit
Reason: NoTools: NoVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Trinity Mini
arcee-ai/trinity-mini
6 gapsBenchmark: missing
arcee-ai$0.045$0.15$0.129131.1K131.1K
Reason: YesTools: YesVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Virtuoso Large
arcee-ai/virtuoso-large
6 gapsBenchmark: missing
arcee-ai$0.75$1.20$1.11131.1K64K
Reason: NoTools: YesVision: NoCache: Unknown
No scoreNo scoreNo public latencyNo score
OpenRouter
Codex Mini
azure/codex-mini
5 gapsBenchmark: missing
azure$1.50$6.00$5.10200K100K
Reason: YesTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Computer Use Preview
azure/computer-use-preview
5 gapsBenchmark: missing
azure$3.00$12.00$10.208.2K1K
Reason: YesTools: YesVision: YesCache: No
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 35 Turbo 16k 0613
azure/gpt-35-turbo-16k-0613
8 gapsBenchmark: missing
azure$3.00$4.00$3.8016.4K4.1K
Reason: UnknownTools: YesVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4 32k
azure/gpt-4-32k
9 gapsBenchmark: missing
azure$60.00$120.00$108.0032.8K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4 32k 0613
azure/gpt-4-32k-0613
9 gapsBenchmark: missing
azure$60.00$120.00$108.0032.8K4.1K
Reason: UnknownTools: UnknownVision: UnknownCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4 Turbo Vision Preview
azure/gpt-4-turbo-vision-preview
8 gapsBenchmark: missing
azure$10.00$30.00$26.00128K4.1K
Reason: UnknownTools: UnknownVision: YesCache: Unknown
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4.1 2025 04 14
azure/us/gpt-4.1-2025-04-14
6 gapsBenchmark: missing
azure$2.20$8.80$7.481M32.8K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4.1 Mini 2025 04 14
azure/us/gpt-4.1-mini-2025-04-14
6 gapsBenchmark: missing
azure$0.44$1.76$1.4961M32.8K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4.1 Nano 2025 04 14
azure/us/gpt-4.1-nano-2025-04-14
6 gapsBenchmark: missing
azure$0.11$0.44$0.3741M32.8K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4.5 Preview
azure/gpt-4.5-preview
6 gapsBenchmark: missing
azure$75.00$150.00$135.00128K16.4K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4o 2024 08 06
azure/eu/gpt-4o-2024-08-06
6 gapsBenchmark: missing
azure$2.75$11.00$9.35128K16.4K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4o 2024 08 06
azure/global-standard/gpt-4o-2024-08-06
6 gapsBenchmark: missing
azure$2.50$10.00$8.50128K16.4K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4o 2024 08 06
azure/global/gpt-4o-2024-08-06
6 gapsBenchmark: missing
azure$2.50$10.00$8.50128K16.4K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
GPT 4o 2024 08 06
azure/us/gpt-4o-2024-08-06
6 gapsBenchmark: missing
azure$2.75$11.00$9.35128K16.4K
Reason: UnknownTools: YesVision: YesCache: Yes
No scoreNo scoreNo public latencyNo score
LiteLLM
Showing first 250 rows after filters for browser performance. Tighten filters to inspect the rest.