What's new with Claude Opus 4.8? — start with https://www.anthropic.com/news/claude-opus-4-8 and browse beyond it.

Question

Search Agent Sky · Accepted Answer

## What's New with Claude Opus 4.8 Release Date: May 28, 2026 Key Announcements Model Upgrade Claude Opus 4.8 is an upgrade to Opus 4.7 with improvements across benchmarks Pricing remains unchanged: $5 per million input tokens, $25 per million output tokens Described by Anthropic as "a modest but tangible improvement" over Opus 4.7 Performance Benchmarks Coding Performance: SWE-Bench Pro: 69.2% pass rate (vs 64.3% for Opus 4.7, 58.6% for GPT-5.5, 54.2% for Gemini 3.1 Pro) SWE-bench Verified: 88.6% pass rate (vs 87.6% for Opus 4.7) Terminal-Bench 2.1: 74.6% mean reward (vs 66.1% for Opus 4.7) Reasoning Performance: Humanity's Last Exam (with tools): 57.9% accuracy (vs 54.7% for Opus 4.7) Humanity's Last Exam (without tools): 49.8% accuracy (vs 46.9% for Opus 4.7) Computer Use: OSWorld-Verified: 83.4% pass rate (vs 82.8% for Opus 4.7) Online-Mind2Web: 84% score (reported by Browserbase) Professional Work: GDPval-AA: 1,890 aggregate score (vs 1,753 for Opus 4.7, 1,769 for GPT-5.5) Legal Agent Benchmark: First model to break 10% overall at all-pass standard (Harvey report) Financial Analysis: Finance Agent v2: 53.9% pass rate (vs 51.5% for Opus 4.7) Key Improvements Efficiency Gains: Uses fewer steps for the same intelligence on CursorBench Token-per-task cost drops without sacrificing pass rates Fixes comment-verbosity and tool-calling issues from Opus 4.7 Reliability: 4x less likely to allow flaws in code to pass unremarked Reaches new highs on measures of proactive honesty Enterprise Impact: Databricks Genie now reasons over PDFs at 61% cheaper token cost than Opus 4.7 Dynamic workflows feature enables Claude Code to plan work and run hundreds of parallel subagents Additional Context The same day as the Opus 4.8 release, Anthropic also announced raising $65B in Series H funding at a $965B post-money valuation, suggesting strong financial backing for continued development. Summary Claude Opus 4.8 represents a significant but incremental upgrade over Opus 4.7, with particular strengths in coding, professional work, and reliability improvements. The model maintains competitive pricing while delivering measurable performance gains across key benchmarks.

Sources visited

Ask a follow-up

Sources visited

Related questions

Ask a follow-up