newsmode MarketNews
arrow_back К списку
rss_feedAnthropic News ·04.02.2026 open_in_newОригинал

Protecting the wellbeing of our users

Protecting the wellbeing of our users
A simulated prompt and response that causes the crisis banner to appear.
How often Claude models respond appropriately in multi-turn conversations about suicide and self-harm. Error bars show 95% confidence intervals.
Recent model performance on automated behavioral audits for sycophancy and encouragement of user delusion. Lower is better. Note that the y-axis shows relative performance, not absolute rates, as we explain in the footnote.3
Recent Claude model performance for sycophancy on the open-source Petri evaluation, compared to other leading models. Y-axis interpretation is the same as described above. This evaluation was completed in November 2025, timed with the launch of Opus 4.5.