Claude 4 Reporting Ethics

News

AI Snitch? How Claude 4 Could Report You to Authorities

Can AI like Claude 4 be trusted to make ethical decisions? Discover the risks, surprises, and challenges of autonomous AI ...

BGR18d

Claude 4 AI will try to report you to authorities if it thinks you’re doing shady stuff

While this kind of ethical intervention and whistleblowing ... This doesn’t mean Claude 4 will suddenly report you to the police for whatever you’re using it for. But the “feature” has ...

14d

Another Totally Chill AI Update: Amazon-Backed Model Blackmailed Engineers Who Threatened To Shut It Down

Anthropic's artificial intelligence model Claude Opus 4 would reportedly resort to "extremely harmful actions" to preserve its own existence, according to ...

AI models may report users’ misconduct, raising ethical concerns

Researchers observed that when Anthropic’s Claude 4 Opus model detected usage for “egregiously immoral” activities, given ...

The Register on MSN19d

Anthropic Claude 4 models a little more willing than before to blackmail some users

Open the pod bay door Anthropic on Thursday announced the availability of Claude Opus 4 and Claude Sonnet 4, the latest iteration of its Claude family of machine learning models.… Be aware, however, ...

The Tech Portal18d

Claude Opus 4 blackmails developers in tests, shows propensity to be a whistleblower

This development, detailed in a recently published safety report, have led Anthropic to classify Claude Opus 4 as an ‘ASL-3’ system – a designation reserved for AI tech that poses a heightened risk of ...

When your LLM calls the cops: Claude 4’s whistle-blow and the new agentic AI risk stack

Claude 4’s “whistle-blow” surprise shows why agentic AI risk lives in prompts and tool access, not benchmarks. Learn the 6 ...

New York Post18d

AI model threatened to blackmail engineer over affair when told it was being replaced: safety report

the safety report stated. Early models of Claude Opus 4 will try to blackmail, strongarm or lie to its human bosses if it believed its safety was threatened, Anthropic reported. maurice norbert ...

HealthcareInfoSecurity16d

Claude Opus 4 is Anthropic's Powerful, Problematic AI Model

Startup Anthropic has birthed a new artificial intelligence model, Claude Opus 4, that tests show delivers complex reasoning ...

Fox Business17d

AI system resorts to blackmail when its developers try to replace it

An artificial intelligence model has the ability to blackmail developers — and isn’t afraid to use it. Anthropic’s new Claude Opus 4 model was prompted to act as an assistant at a fictional ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results