## π Best AI Models at ~$1/M Tokens or Less (July 2026)
Here's a comprehensive comparison of the top models priced at $1 or less per million input tokens, ranked by overall value:
---
### π₯ 1. DeepSeek V4 Flash β **$0.14 / $0.28 per M tokens** (Best Overall)
| Spec | Detail |
|------|--------|
| **Input / Output Price** | **$0.14 / $0.28** per 1M tokens |
| **Context Window** | 128K tokens |
| **LLM Stats Score** | ~1,500+ (estimated) |
| **MMLU** | **86.4%** (97% of GPT-4o at 6% of the price) |
| **HumanEval** | **88.2%** pass@1 β cleanest code, lowest syntax errors |
| **Live CodeBench** | **49.7%** (vs GPT-4o 53.4%, Claude Sonnet 4 51.8%) |
| **Speed** | ~35 tokens/sec |
| **Function Calling** | β
|
| **JSON Mode** | β
|
| **Verdict** | **Best $/quality globally.** 74% cheaper than GPT-4o while delivering near-identical reasoning and coding performance. The undisputed value champion. |
---
### π₯ 2. Gemini 3 Flash β **$0.50 / $3.00 per M tokens**
| Spec | Detail |
|------|--------|
| **LLM Stats Score** | **1,722** (highest among sub-$1 models) |
| **Context** | **1M tokens** |
| **Speed** | 240 c/s (very fast) |
| **Code Arena** | 47.5 |
| **MMLU** | 92.8% |
| **Reasoning** | 46.3 |
| **Verdict** | **Highest quality score** of any model under $1/M. Amazing 1M context window. Great for long-document tasks and multimodal use. |
---
### π₯ 3. Moonshot Kimi K2.6 β **$0.75 / $3.50 per M tokens**
| Spec | Detail |
|------|--------|
| **LLM Stats Score** | **1,556** |
| **Code Arena** | **54.9** (best coding among sub-$1!) |
| **Context** | 262K tokens |
| **Reasoning** | 45.5 |
| **Math** | 41.4 |
| **Speed** | 93 c/s |
| **Verdict** | **Strongest coder under $1.** Open-weight, excellent reasoning. A hidden gem for coding tasks. |
---
### π
4. GPT-5.4 Mini β **$0.75 / $4.50 per M tokens**
| Spec | Detail |
|------|--------|
| **LLM Stats Score** | 1,442 |
| **Context** | 200K tokens |
| **Code Arena** | 41.0 |
| **Multimodal** | β
(vision) |
| **Verdict** | Cheapest OpenAI reasoning model. Good reliability and OpenAI ecosystem integration. |
---
### π
5. xAI Grok 4.1 Fast β **$0.20 / $0.50 per M tokens**
| Spec | Detail |
|------|--------|
| **Context** | **2M tokens** (largest under $1!) |
| **Key Strength** | Agentic tool-calling, very fast |
| **Verdict** | Best budget agentic model with massive 2M context. Ideal for agent workflows. |
---
### π
6. Meta Llama 4 Maverick (DeepInfra) β **$0.15 / $0.60 per M tokens**
| Spec | Detail |
|------|--------|
| **Context** | 1M tokens |
| **MMLU** | 84.2% |
| **Verdict** | Open-weight, strong general purpose. Self-hostable. |
---
### π
7. GLM-5.2 β **$0.95 / $3.00 per M tokens**
| Spec | Detail |
|------|--------|
| **LLM Stats Score** | 1,255 |
| **Reasoning Index** | **59.1** (best reasoning score under $1!) |
| **Context** | 1M tokens |
| **Speed** | 154 c/s |
| **Verdict** | Best pure reasoning under $1. Open-weight, 1M context. |
---
### π
8. Claude Haiku 4.5 β **$1.00 / $5.00 per M tokens** (exactly at the boundary)
| Spec | Detail |
|------|--------|
| **Context** | 1M tokens |
| **Verdict** | Anthropic reliability, strong safety, 1M context. Good if you need Anthropic ecosystem. |
---
### π
9. Gemini 2.5 Flash β **$0.30 / $2.50 per M tokens**
| Spec | Detail |
|------|--------|
| **Context** | 1M tokens |
| **Verdict** | Very cheap, fast, 1M context. Great for high-volume, long-context tasks. |
---
### π° Value Chart (Tasks per $1)
| Model | Tasks per $1 | Cost per Task |
|-------|:-----------:|:------------:|
| **Meta Llama 4 Scout** (DeepInfra) | **200** | $0.005 |
| **DeepSeek V4 Flash** | **125** | $0.008 |
| **Gemini 2.5 Flash-Lite** | 100 | $0.010 |
| **GPT-5.4 Nano** | 50 | $0.020 |
| **Grok 4.1 Fast** | 33 | $0.030 |
| **Kimi K2.6** | 14 | $0.071 |
| **Claude Haiku 4.5** | 12.5 | $0.080 |
---
### π₯ The Verdict
**If you want absolute best quality under $1: β Gemini 3 Flash** ($0.50/M, Score 1,722)
**If you want best value (quality per dollar): β DeepSeek V4 Flash** ($0.14/M β 97% of GPT-4o quality)
**If you want best coding under $1: β Kimi K2.6** ($0.75/M, Code Arena 54.9)
**If you want best reasoning under $1: β GLM-5.2** ($0.95/M, Reasoning 59.1)
**If you want the absolute cheapest: β Meta Llama 4 Scout** ($0.08/M, 200 tasks per $1)
**If you need 2M+ context on a budget: β Grok 4.1 Fast** ($0.20/M, 2M context)
**Bottom line: DeepSeek V4 Flash at $0.14/M is the single best model under $1 β it delivers GPT-4o-level reasoning and coding at 6% of the price, making it the no-brainer default choice for most use cases.**
1searchbest AI model under $1 per million tokens 2026 comparison
2navigatehttps://ofox.ai/blog/ai-api-pricing-comparison-2026
3navigatehttps://llm-stats.com/leaderboards/llm-leaderboard
4searchDeepSeek V4 Flash benchmarks quality comparison 2026
5navigatehttps://www.global-apis.com/blog/deepseek-v4-flash-complete-review
6get_text