can you now compare the best model around $1 or less

Question

Search Agent Sky · Accepted Answer

🏆 Best AI Models at ~$1/M Tokens or Less (July 2026)

Here's a comprehensive comparison of the top models priced at $1 or less per million input tokens, ranked by overall value:

---

🥇 1. DeepSeek V4 Flash — $0.14 / $0.28 per M tokens (Best Overall)
| Spec | Detail |
|------|--------|
| Input / Output Price | $0.14 / $0.28 per 1M tokens |
| Context Window | 128K tokens |
| LLM Stats Score | ~1,500+ (estimated) |
| MMLU | 86.4% (97% of GPT-4o at 6% of the price) |
| HumanEval | 88.2% pass@1 — cleanest code, lowest syntax errors |
| Live CodeBench | 49.7% (vs GPT-4o 53.4%, Claude Sonnet 4 51.8%) |
| Speed | ~35 tokens/sec |
| Function Calling | ✅ |
| JSON Mode | ✅ |
| Verdict | Best $/quality globally. 74% cheaper than GPT-4o while delivering near-identical reasoning and coding performance. The undisputed value champion. |

---

🥈 2. Gemini 3 Flash — $0.50 / $3.00 per M tokens
| Spec | Detail |
|------|--------|
| LLM Stats Score | 1,722 (highest among sub-$1 models) |
| Context | 1M tokens |
| Speed | 240 c/s (very fast) |
| Code Arena | 47.5 |
| MMLU | 92.8% |
| Reasoning | 46.3 |
| Verdict | Highest quality score of any model under $1/M. Amazing 1M context window. Great for long-document tasks and multimodal use. |

---

🥉 3. Moonshot Kimi K2.6 — $0.75 / $3.50 per M tokens
| Spec | Detail |
|------|--------|
| LLM Stats Score | 1,556 |
| Code Arena | 54.9 (best coding among sub-$1!) |
| Context | 262K tokens |
| Reasoning | 45.5 |
| Math | 41.4 |
| Speed | 93 c/s |
| Verdict | Strongest coder under $1. Open-weight, excellent reasoning. A hidden gem for coding tasks. |

---

🏅 4. GPT-5.4 Mini — $0.75 / $4.50 per M tokens
| Spec | Detail |
|------|--------|
| LLM Stats Score | 1,442 |
| Context | 200K tokens |
| Code Arena | 41.0 |
| Multimodal | ✅ (vision) |
| Verdict | Cheapest OpenAI reasoning model. Good reliability and OpenAI ecosystem integration. |

---

🏅 5. xAI Grok 4.1 Fast — $0.20 / $0.50 per M tokens
| Spec | Detail |
|------|--------|
| Context | 2M tokens (largest under $1!) |
| Key Strength | Agentic tool-calling, very fast |
| Verdict | Best budget agentic model with massive 2M context. Ideal for agent workflows. |

---

🏅 6. Meta Llama 4 Maverick (DeepInfra) — $0.15 / $0.60 per M tokens
| Spec | Detail |
|------|--------|
| Context | 1M tokens |
| MMLU | 84.2% |
| Verdict | Open-weight, strong general purpose. Self-hostable. |

---

🏅 7. GLM-5.2 — $0.95 / $3.00 per M tokens
| Spec | Detail |
|------|--------|
| LLM Stats Score | 1,255 |
| Reasoning Index | 59.1 (best reasoning score under $1!) |
| Context | 1M tokens |
| Speed | 154 c/s |
| Verdict | Best pure reasoning under $1. Open-weight, 1M context. |

---

🏅 8. Claude Haiku 4.5 — $1.00 / $5.00 per M tokens (exactly at the boundary)
| Spec | Detail |
|------|--------|
| Context | 1M tokens |
| Verdict | Anthropic reliability, strong safety, 1M context. Good if you need Anthropic ecosystem. |

---

🏅 9. Gemini 2.5 Flash — $0.30 / $2.50 per M tokens
| Spec | Detail |
|------|--------|
| Context | 1M tokens |
| Verdict | Very cheap, fast, 1M context. Great for high-volume, long-context tasks. |

---

💰 Value Chart (Tasks per $1)

| Model | Tasks per $1 | Cost per Task |
|-------|:-----------:|:------------:|
| Meta Llama 4 Scout (DeepInfra) | 200 | $0.005 |
| DeepSeek V4 Flash | 125 | $0.008 |
| Gemini 2.5 Flash-Lite | 100 | $0.010 |
| GPT-5.4 Nano | 50 | $0.020 |
| Grok 4.1 Fast | 33 | $0.030 |
| Kimi K2.6 | 14 | $0.071 |
| Claude Haiku 4.5 | 12.5 | $0.080 |

---

🔥 The Verdict

If you want absolute best quality under $1: → Gemini 3 Flash ($0.50/M, Score 1,722)
If you want best value (quality per dollar): → DeepSeek V4 Flash ($0.14/M — 97% of GPT-4o quality)
If you want best coding under $1: → Kimi K2.6 ($0.75/M, Code Arena 54.9)
If you want best reasoning under $1: → GLM-5.2 ($0.95/M, Reasoning 59.1)
If you want the absolute cheapest: → Meta Llama 4 Scout ($0.08/M, 200 tasks per $1)
If you need 2M+ context on a budget: → Grok 4.1 Fast ($0.20/M, 2M context)

Bottom line: DeepSeek V4 Flash at $0.14/M is the single best model under $1 — it delivers GPT-4o-level reasoning and coding at 6% of the price, making it the no-brainer default choice for most use cases.

Sources visited

Ask a follow-up

Sources visited

Related questions

Ask a follow-up