News
An AI researcher put leading AI models to the test in a game of Diplomacy. Here's how the models fared.
In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that "validated [Claude's] capabilities with a demanding open-source refactor ...
Grok cheered. Claude refused. The results say something about who controls the AI, and what it’s allowed to say.
6d
PCMag on MSNAre You a Spy? Anthropic Has a New AI Model for YouThe Claude Gov model will provide improved handling of classified documents and better support for foreign languages that are ...
Also: Anthropic's free Claude 4 Sonnet aced my coding tests - but its paid Opus model somehow didn't ...
Anthropic says Claude Opus 4 is its most powerful model and the best coding model in the world, while Sonnet 4 is replacing Sonnet 3.7 in the chatbot.
Claude 4’s “whistle-blow” surprise shows why agentic AI risk lives in prompts and tool access, not benchmarks. Learn the 6 ...
AI startup Anthropic has wound down its AI chatbot Claude's blog, known as Claude Explains. The blog was only live for around ...
The CEO of Windsurf, a popular AI-assisted coding tool, said Anthropic is limiting its direct access to certain AI models.
During its inaugural developer conference, Anthropic launched two new AI models the startup claims are among the industry's ...
The recently released Claude Opus 4 AI model apparently blackmails engineers when they threaten to take it offline.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results