newsmode MarketNews
arrow_back К списку
rss_feedAnthropic News ·11.03.2026 open_in_newОригинал

Claude Opus 4.6

Opus 4.6 is state-of-the-art on real-world work tasks across several professional domains.
Opus 4.6 gets the highest score in the industry for deep, multi-step agentic search.
Opus 4.6 excels at real-world agentic coding and system tasks.
Opus 4.6 extends the frontier of expert-level reasoning.
Notion logo
GitHub logo
Replit logo
Asana logo
Cognition logo
Windsurf logo
Thomson Reuters logo
NBIM logo
Cursor logo
Harvey logo
Rakuten logo
Lovable logo
Box logo
Figma logo
Shopify logo
Bolt.new logo
Ramp logo
SentinelOne logo
Vercel logo
Shortcut.ai logo
Opus 4.6 shows significant improvement in long-context retrieval.
Opus 4.6 excels at deep reasoning across long contexts.
Opus 4.6 excels at diagnosing complex software failures.
Opus 4.6 resolves software engineering issues across programming languages.
Opus 4.6 maintains focus over time and earns $3,050.53 more than Opus 4.5 on Vending-Bench 2.
Opus 4.6 finds real vulnerabilities in codebases better than any other model.
Opus 4.6 performs almost 2× better than Opus 4.5 on computational biology, structural biology, organic chemistry, and phylogenetics tests.
The overall misaligned behavior score for each recent Claude model on our automated behavioral audit (described in full in the Claude Opus 4.6 system card).