News

If you buy through affiliate links, we may earn commissions, which help support our testing. AI start-up Anthropic’s newly ...
Anthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was ...
Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...
Malicious use is one thing, but there's also increased potential for Anthropic's new models going rogue. In the alignment section of Claude 4's system card, Anthropic reported a sinister discovery ...
Anthropic reported that its newest model, Claude Opus 4, used blackmailing as a last resort after being told it could get ...