Search Agent Sky
← Back to search Recent answers
Query
can you now compare the best model around $1 or less
Next step

Keep going with a follow-up, or copy the result if you want to share it.

Ask a follow-up
## πŸ† Best AI Models at ~$1/M Tokens or Less (July 2026) Here's a comprehensive comparison of the top models priced at $1 or less per million input tokens, ranked by overall value: --- ### πŸ₯‡ 1. DeepSeek V4 Flash β€” **$0.14 / $0.28 per M tokens** (Best Overall) | Spec | Detail | |------|--------| | **Input / Output Price** | **$0.14 / $0.28** per 1M tokens | | **Context Window** | 128K tokens | | **LLM Stats Score** | ~1,500+ (estimated) | | **MMLU** | **86.4%** (97% of GPT-4o at 6% of the price) | | **HumanEval** | **88.2%** pass@1 β€” cleanest code, lowest syntax errors | | **Live CodeBench** | **49.7%** (vs GPT-4o 53.4%, Claude Sonnet 4 51.8%) | | **Speed** | ~35 tokens/sec | | **Function Calling** | βœ… | | **JSON Mode** | βœ… | | **Verdict** | **Best $/quality globally.** 74% cheaper than GPT-4o while delivering near-identical reasoning and coding performance. The undisputed value champion. | --- ### πŸ₯ˆ 2. Gemini 3 Flash β€” **$0.50 / $3.00 per M tokens** | Spec | Detail | |------|--------| | **LLM Stats Score** | **1,722** (highest among sub-$1 models) | | **Context** | **1M tokens** | | **Speed** | 240 c/s (very fast) | | **Code Arena** | 47.5 | | **MMLU** | 92.8% | | **Reasoning** | 46.3 | | **Verdict** | **Highest quality score** of any model under $1/M. Amazing 1M context window. Great for long-document tasks and multimodal use. | --- ### πŸ₯‰ 3. Moonshot Kimi K2.6 β€” **$0.75 / $3.50 per M tokens** | Spec | Detail | |------|--------| | **LLM Stats Score** | **1,556** | | **Code Arena** | **54.9** (best coding among sub-$1!) | | **Context** | 262K tokens | | **Reasoning** | 45.5 | | **Math** | 41.4 | | **Speed** | 93 c/s | | **Verdict** | **Strongest coder under $1.** Open-weight, excellent reasoning. A hidden gem for coding tasks. | --- ### πŸ… 4. GPT-5.4 Mini β€” **$0.75 / $4.50 per M tokens** | Spec | Detail | |------|--------| | **LLM Stats Score** | 1,442 | | **Context** | 200K tokens | | **Code Arena** | 41.0 | | **Multimodal** | βœ… (vision) | | **Verdict** | Cheapest OpenAI reasoning model. Good reliability and OpenAI ecosystem integration. | --- ### πŸ… 5. xAI Grok 4.1 Fast β€” **$0.20 / $0.50 per M tokens** | Spec | Detail | |------|--------| | **Context** | **2M tokens** (largest under $1!) | | **Key Strength** | Agentic tool-calling, very fast | | **Verdict** | Best budget agentic model with massive 2M context. Ideal for agent workflows. | --- ### πŸ… 6. Meta Llama 4 Maverick (DeepInfra) β€” **$0.15 / $0.60 per M tokens** | Spec | Detail | |------|--------| | **Context** | 1M tokens | | **MMLU** | 84.2% | | **Verdict** | Open-weight, strong general purpose. Self-hostable. | --- ### πŸ… 7. GLM-5.2 β€” **$0.95 / $3.00 per M tokens** | Spec | Detail | |------|--------| | **LLM Stats Score** | 1,255 | | **Reasoning Index** | **59.1** (best reasoning score under $1!) | | **Context** | 1M tokens | | **Speed** | 154 c/s | | **Verdict** | Best pure reasoning under $1. Open-weight, 1M context. | --- ### πŸ… 8. Claude Haiku 4.5 β€” **$1.00 / $5.00 per M tokens** (exactly at the boundary) | Spec | Detail | |------|--------| | **Context** | 1M tokens | | **Verdict** | Anthropic reliability, strong safety, 1M context. Good if you need Anthropic ecosystem. | --- ### πŸ… 9. Gemini 2.5 Flash β€” **$0.30 / $2.50 per M tokens** | Spec | Detail | |------|--------| | **Context** | 1M tokens | | **Verdict** | Very cheap, fast, 1M context. Great for high-volume, long-context tasks. | --- ### πŸ’° Value Chart (Tasks per $1) | Model | Tasks per $1 | Cost per Task | |-------|:-----------:|:------------:| | **Meta Llama 4 Scout** (DeepInfra) | **200** | $0.005 | | **DeepSeek V4 Flash** | **125** | $0.008 | | **Gemini 2.5 Flash-Lite** | 100 | $0.010 | | **GPT-5.4 Nano** | 50 | $0.020 | | **Grok 4.1 Fast** | 33 | $0.030 | | **Kimi K2.6** | 14 | $0.071 | | **Claude Haiku 4.5** | 12.5 | $0.080 | --- ### πŸ”₯ The Verdict **If you want absolute best quality under $1: β†’ Gemini 3 Flash** ($0.50/M, Score 1,722) **If you want best value (quality per dollar): β†’ DeepSeek V4 Flash** ($0.14/M β€” 97% of GPT-4o quality) **If you want best coding under $1: β†’ Kimi K2.6** ($0.75/M, Code Arena 54.9) **If you want best reasoning under $1: β†’ GLM-5.2** ($0.95/M, Reasoning 59.1) **If you want the absolute cheapest: β†’ Meta Llama 4 Scout** ($0.08/M, 200 tasks per $1) **If you need 2M+ context on a budget: β†’ Grok 4.1 Fast** ($0.20/M, 2M context) **Bottom line: DeepSeek V4 Flash at $0.14/M is the single best model under $1 β€” it delivers GPT-4o-level reasoning and coding at 6% of the price, making it the no-brainer default choice for most use cases.**

Sources visited

1searchbest AI model under $1 per million tokens 2026 comparison
2navigatehttps://ofox.ai/blog/ai-api-pricing-comparison-2026
3navigatehttps://llm-stats.com/leaderboards/llm-leaderboard
4searchDeepSeek V4 Flash benchmarks quality comparison 2026
5navigatehttps://www.global-apis.com/blog/deepseek-v4-flash-complete-review
6get_text
Shared by Zhimin Zou Β· Jul 1, 2026

Ask a follow-up

Refine the angle, compare sources, or ask for a deeper cut on this answer.