
AI Model Pricing Guide 2026: What Every Model Costs on Crazyrouter (and How Much You Save)
AI Model Pricing Guide 2026: What Every Model Costs on Crazyrouter (and How Much You Save)#

AI model pricing changes fast. New models launch every month, providers adjust rates, and keeping track of what you're actually paying per token across multiple providers is a full-time job.
This guide covers 18 of the most popular AI models available through Crazyrouter — with official provider pricing, Crazyrouter's discounted rates, and what it all means for your monthly bill.
Last updated: April 27, 2026
Important: AI model pricing is subject to change. Providers may adjust rates at any time. Always check the latest pricing on the official provider pages and Crazyrouter's pricing page before making decisions. The prices listed here are accurate as of the publication date.
How Crazyrouter Pricing Works#
Crazyrouter offers unified API access to 300+ models from all major providers. Instead of managing separate accounts with OpenAI, Anthropic, Google, xAI, and others, you use one API key and one balance.
The key benefit: Crazyrouter passes through models at a discount ranging from 55% to 100% of official pricing (i.e., 45% off down to full price), depending on the model. Most flagship models from OpenAI, Anthropic, Google, and xAI are available at 55% of official price — that's a 45% discount.
Complete Pricing Table: 18 Models Compared#
Here's the full breakdown. All prices are in USD per 1 million tokens.

Premium Tier ($5+ input)#
| Model | Provider | Official Input | Official Output | Crazyrouter Input | Crazyrouter Output | Discount |
|---|---|---|---|---|---|---|
| claude-opus-4-7 | Anthropic | $5.00 | $25.00 | $2.75 | $13.75 | 55% |
| claude-opus-4-6 | Anthropic | $5.00 | $25.00 | $2.75 | $13.75 | 55% |
| grok-4.1-thinking | xAI | $5.00 | $25.00 | $2.75 | $13.75 | 55% |
These are the heaviest hitters — deep reasoning, complex analysis, and extended thinking models. At official rates, a million output tokens from Claude Opus 4.7 costs 13.75.
Mid-Tier (3 input)#
| Model | Provider | Official Input | Official Output | Crazyrouter Input | Crazyrouter Output | Discount |
|---|---|---|---|---|---|---|
| claude-sonnet-4-6 | Anthropic | $3.00 | $15.00 | $1.65 | $8.25 | 55% |
| claude-sonnet-4-5-20250929 | Anthropic | $3.00 | $15.00 | $1.65 | $8.25 | 55% |
| gpt-5.4 | OpenAI | $2.50 | $15.00 | $1.375 | $7.70 | 55% |
| gpt-4o | OpenAI | $2.50 | $10.00 | $1.375 | $5.50 | 55% |
| grok-4.1 | xAI | $3.00 | $15.00 | $1.65 | $8.25 | 55% |
| gemini-3.1-pro-preview | $2.00 | $12.00 | $1.10 | $6.60 | 55% | |
| gemini-3-pro-preview | $2.00 | $12.00 | $1.10 | $6.60 | 55% | |
| gpt-5.2 | OpenAI | $1.75 | $14.00 | $0.9625 | $7.70 | 55% |
| gpt-5 | OpenAI | $1.25 | $10.00 | $0.6875 | $5.50 | 55% |
| gpt-5.1-codex-max | OpenAI | $1.25 | $10.00 | $0.6875 | $5.50 | 55% |
This is where most production workloads live. Claude Sonnet 4.6 and GPT-5.4 are the workhorses for coding, chat, and general-purpose tasks. The Gemini 3 Pro models offer strong value at 1.10.
Budget Tier (Under $1 input)#
| Model | Provider | Official Input | Official Output | Crazyrouter Input | Crazyrouter Output | Discount |
|---|---|---|---|---|---|---|
| glm-5 | Zhipu AI | $0.60 | $2.08 | $0.60 | $2.08 | 100% |
| MiniMax-M2.7 | MiniMax | $0.30 | $1.20 | $0.30 | $1.20 | 100% |
| gpt-5-mini | OpenAI | $0.25 | $2.00 | $0.1375 | $1.10 | 55% |
| gemini-2.5-flash-lite | $0.10 | $0.40 | $0.055 | $0.22 | 55% | |
| gpt-5-nano | OpenAI | $0.05 | $0.40 | $0.0275 | $0.22 | 55% |
Budget models are perfect for high-volume tasks: classification, extraction, routing, summarization. GPT-5-nano through Crazyrouter costs just $0.0275 per million input tokens — that's essentially free for most use cases. MiniMax M2.7 and GLM-5 are passed through at full official pricing but are already extremely affordable.
Model-by-Model Breakdown#
1. claude-sonnet-4-6#
Provider: Anthropic | Released: February 2026 | Context: 200K tokens
Claude Sonnet 4.6 is Anthropic's latest mid-range model — fast, capable, and the go-to choice for most coding and analysis tasks. It supports both OpenAI-compatible and native Anthropic API formats through Crazyrouter.
| Official | Crazyrouter | |
|---|---|---|
| Input | $3.00/M | $1.65/M |
| Output | $15.00/M | $8.25/M |
| Cache Write (5min) | $3.75/M | — |
| Cache Hit | $0.30/M | — |
Best for: Coding assistance, document analysis, chat applications, tool use.
2. claude-opus-4-6#
Provider: Anthropic | Released: February 2026 | Context: 200K tokens
The Opus tier is Anthropic's most capable model line. Opus 4.6 excels at complex reasoning, nuanced writing, and tasks requiring deep understanding.
| Official | Crazyrouter | |
|---|---|---|
| Input | $5.00/M | $2.75/M |
| Output | $25.00/M | $13.75/M |
Best for: Complex reasoning, research, long-form content, tasks where quality matters more than speed.
3. claude-opus-4-7#
Provider: Anthropic | Released: April 2026 | Context: 200K tokens
The newest Opus model with a new tokenizer (may use up to 35% more tokens for the same text). Top-tier intelligence across all benchmarks.
| Official | Crazyrouter | |
|---|---|---|
| Input | $5.00/M | $2.75/M |
| Output | $25.00/M | $13.75/M |
Note: The new tokenizer means your effective cost per request may be higher than Opus 4.6 for the same input text. Factor this in when comparing.
Best for: The most demanding tasks — advanced reasoning, research synthesis, complex code generation.
4. claude-sonnet-4-5-20250929#
Provider: Anthropic | Released: September 2025 | Context: 200K tokens
The previous-generation Sonnet, still widely used and fully supported. Same pricing as Sonnet 4.6.
| Official | Crazyrouter | |
|---|---|---|
| Input | $3.00/M | $1.65/M |
| Output | $15.00/M | $8.25/M |
Best for: Production systems already built on Sonnet 4.5 that don't need to migrate yet.
5. gpt-5.4#
Provider: OpenAI | Released: April 2026 | Context: 270K tokens
OpenAI's latest flagship. Strong at coding and professional work, with automatic prompt caching at 10% of input price.
| Official | Crazyrouter | |
|---|---|---|
| Input | $2.50/M | $1.375/M |
| Output | $15.00/M | $7.70/M* |
| Cached Input | $0.25/M | — |
*Crazyrouter output pricing reflects the model ratio and completion ratio applied.
Best for: Coding, professional analysis, multimodal tasks.
6. gpt-5.2#
Provider: OpenAI | Released: December 2025 | Context: 200K tokens
A strong mid-generation model. Cheaper than GPT-5.4 with solid performance.
| Official | Crazyrouter | |
|---|---|---|
| Input | $1.75/M | $0.9625/M |
| Output | $14.00/M | $7.70/M |
Best for: Cost-conscious production workloads that don't need the absolute latest model.
7. gpt-5#
Provider: OpenAI | Released: August 2025 | Context: 400K tokens
The original GPT-5 — still OpenAI's best overall quality-to-price ratio for general tasks.
| Official | Crazyrouter | |
|---|---|---|
| Input | $1.25/M | $0.6875/M |
| Output | $10.00/M | $5.50/M |
Best for: General-purpose tasks, multimodal work, large context windows.
8. gpt-5.1-codex-max#
Provider: OpenAI | Released: November 2025 | Context: 200K tokens
The max-tier Codex model optimized for code generation and understanding. Higher output ratio reflects the model's tendency to generate longer, more detailed code.
| Official | Crazyrouter | |
|---|---|---|
| Input | $1.25/M | $0.6875/M |
| Output | $10.00/M | $5.50/M |
Best for: Code generation, code review, automated refactoring, developer tooling.
9. gpt-5-mini#
Provider: OpenAI | Released: August 2025 | Context: 400K tokens
Surprisingly capable for its price. GPT-5-mini delivers strong quality at a fraction of the flagship cost.
| Official | Crazyrouter | |
|---|---|---|
| Input | $0.25/M | $0.1375/M |
| Output | $2.00/M | $1.10/M |
Best for: High-volume chat, classification, summarization, budget-conscious applications.
10. gpt-5-nano#
Provider: OpenAI | Released: August 2025 | Context: 400K tokens
The cheapest model in OpenAI's lineup. Ideal for tasks where speed and cost matter more than peak intelligence.
| Official | Crazyrouter | |
|---|---|---|
| Input | $0.05/M | $0.0275/M |
| Output | $0.40/M | $0.22/M |
Best for: Classification, routing, extraction, high-volume preprocessing, intent detection.
11. gpt-4o#
Provider: OpenAI | Released: May 2024 | Context: 128K tokens
The previous-generation flagship. Still stable, well-understood, and widely integrated.
| Official | Crazyrouter | |
|---|---|---|
| Input | $2.50/M | $1.375/M |
| Output | $10.00/M | $5.50/M |
Best for: Existing integrations, stable production systems, multimodal tasks.
12. grok-4.1#
Provider: xAI | Released: October 2025 | Context: 256K tokens
xAI's standard model. Competitive with Claude Sonnet and GPT-5 on most benchmarks, with a large context window.
| Official | Crazyrouter | |
|---|---|---|
| Input | $3.00/M | $1.65/M |
| Output | $15.00/M | $8.25/M |
Best for: General-purpose tasks, X/Twitter data analysis, alternative to Claude Sonnet.
13. grok-4.1-thinking#
Provider: xAI | Released: October 2025 | Context: 256K tokens
The reasoning variant of Grok 4.1. Uses extended thinking for complex multi-step problems.
| Official | Crazyrouter | |
|---|---|---|
| Input | $5.00/M | $2.75/M |
| Output | $25.00/M | $13.75/M |
Best for: Complex reasoning, math, multi-step analysis, tasks requiring chain-of-thought.
14. gemini-3.1-pro-preview#
Provider: Google | Released: February 2026 | Context: 1M tokens
Google's latest Gemini Pro with a massive 1 million token context window. Strong across all benchmarks.
| Official | Crazyrouter | |
|---|---|---|
| Input | $2.00/M | $1.10/M |
| Output | $12.00/M | $6.60/M |
Best for: Long-document analysis, large codebases, tasks requiring massive context.
15. gemini-3-pro-preview#
Provider: Google | Released: November 2025 | Context: 1M tokens
The previous Gemini 3 Pro. Same pricing as 3.1, still fully supported.
| Official | Crazyrouter | |
|---|---|---|
| Input | $2.00/M | $1.10/M |
| Output | $12.00/M | $6.60/M |
Best for: Production systems already on Gemini 3 Pro.
16. gemini-2.5-flash-lite#
Provider: Google | Released: June 2025 | Context: 1M tokens
Ultra-cheap, ultra-fast. Google's most affordable model with a full 1M context window.
| Official | Crazyrouter | |
|---|---|---|
| Input | $0.10/M | $0.055/M |
| Output | $0.40/M | $0.22/M |
Best for: High-volume tasks, preprocessing, classification, anywhere cost is the primary concern.
17. MiniMax-M2.7#
Provider: MiniMax | Released: March 2026 | Context: 205K tokens
A strong Chinese AI model with competitive benchmark scores (87.4 GPQA). Available at full official pricing through Crazyrouter.
| Official | Crazyrouter | |
|---|---|---|
| Input | $0.30/M | $0.30/M |
| Output | $1.20/M | $1.20/M |
Best for: Budget-friendly general tasks, multilingual applications, Chinese language tasks.
18. glm-5#
Provider: Zhipu AI | Released: February 2026 | Context: 203K tokens
Zhipu AI's flagship model. Strong reasoning capabilities with competitive pricing. Available at full official pricing.
| Official | Crazyrouter | |
|---|---|---|
| Input | $0.60/M | $0.60/M |
| Output | $2.08/M | $2.08/M |
Best for: Chinese language tasks, reasoning, budget-conscious applications needing strong general intelligence.
Cost Comparison: Real-World Scenarios#
Let's put these numbers in context with three common workloads.

Scenario 1: Chatbot (1M input + 500K output tokens/day)#
| Model | Official Daily Cost | Crazyrouter Daily Cost | Monthly Savings |
|---|---|---|---|
| claude-sonnet-4-6 | $10.50 | $5.78 | $142 |
| gpt-5.4 | $10.00 | $5.23 | $143 |
| gpt-5-mini | $1.25 | $0.69 | $17 |
Scenario 2: Code Generation (500K input + 2M output tokens/day)#
| Model | Official Daily Cost | Crazyrouter Daily Cost | Monthly Savings |
|---|---|---|---|
| gpt-5.1-codex-max | $20.63 | $11.34 | $279 |
| claude-sonnet-4-6 | $31.50 | $17.33 | $425 |
| gpt-5.4 | $31.25 | $16.09 | $455 |
Scenario 3: High-Volume Classification (10M input + 1M output tokens/day)#
| Model | Official Daily Cost | Crazyrouter Daily Cost | Monthly Savings |
|---|---|---|---|
| gpt-5-nano | $0.90 | $0.50 | $12 |
| gemini-2.5-flash-lite | $1.40 | $0.77 | $19 |
| gpt-5-mini | $4.50 | $2.48 | $61 |
How to Choose the Right Model#
Maximize quality: Claude Opus 4.7 or GPT-5.4 — both are frontier-class. Opus 4.7 edges ahead on reasoning; GPT-5.4 is strong on coding.
Balance quality and cost: Claude Sonnet 4.6, GPT-5, or Gemini 3.1 Pro Preview. All deliver excellent results at 1.10/M input.
Minimize cost: GPT-5-nano (0.055/M). For slightly better quality at still-low prices, GPT-5-mini at $0.1375/M is hard to beat.
Need massive context: Gemini models offer 1M token context windows. GPT-5 family supports 400K. Both are available through Crazyrouter with the same discounts.
Getting Started#
- Sign up at crazyrouter.com
- Get your API key from the console
- Use the OpenAI-compatible endpoint — no code changes needed if you're already using the OpenAI SDK:
from openai import OpenAI
client = OpenAI(
api_key="your-crazyrouter-key",
base_url="https://crazyrouter.com/v1"
)
response = client.chat.completions.create(
model="claude-sonnet-4-6", # or any of the 300+ models
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
# Or use curl directly
curl https://crazyrouter.com/v1/chat/completions \
-H "Authorization: Bearer your-crazyrouter-key" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-5-mini",
"messages": [{"role": "user", "content": "Hello!"}]
}'
Price Disclaimer#
Prices listed in this article are accurate as of April 27, 2026. AI model pricing is volatile — providers frequently adjust rates as new models launch and competition evolves. Before committing to a model for production use:
- Check the official provider pricing page for the latest rates
- Review Crazyrouter's pricing page for current discounts
- Monitor your actual usage in the Crazyrouter console dashboard
- Consider using multiple models and routing based on task complexity to optimize costs
Crazyrouter's discount rates (55%–100% of official pricing) may also change as provider agreements evolve. The console always shows your real-time per-request cost.
Summary#
Across these 18 models, Crazyrouter saves you 10–45% on most flagship models compared to going direct. The biggest savings come from premium models like Claude Opus (13.75 per M output tokens) and reasoning models like Grok 4.1 Thinking.
For budget models like MiniMax M2.7 and GLM-5, Crazyrouter passes through at official pricing — but the value is in unified access. One API key, one balance, one integration for all 18 models (and 300+ more).
The real power move: use Crazyrouter's model routing to automatically pick the cheapest model that meets your quality threshold for each request. That's where the savings compound.

