Claude’s new model is more ‘honest’ when it messes up

Published: 1 week ago (May 28, 2026 at 01:00 PM EDT)

1 min read

Source: The Verge

Claude Opus 4.8

Anthropic is releasing Claude Opus 4.8 on Thursday, touting the model’s “honesty.” According to Anthropic, it trains “all [its] models to be honest – for instance, to avoid making claims that they can’t support.” The company notes that “a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence.”

Early testers have found that Opus 4.8 “is more likely to flag uncertainties about its work and less likely to make unsupported claims.” In the company’s evaluations, Opus 4.8 is “around 4× less likely than its predecessor …”

Read the full story at The Verge.

Claude’s new model is more ‘honest’ when it messes up

Claude Opus 4.8

Related posts

Anthropic Releases Opus 4.8 With New 'Dynamic Workflow' Tool

Claude Opus 4.8: Anthropic makes a more honest AI

Claude Opus 4.8 launches today with agentic improvements, new features

Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty