However Briefly

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

Kylie Robison

created: May 28, 2025, 7:40 p.m. | updated: June 1, 2025, 5:02 a.m.

Bowman deleted the post shortly after he shared it, but the narrative about Claude’s whistleblower tendencies had already escaped containment. “Claude is a snitch,” became a common refrain in some tech circles on social media. I wasn't surprised to see some kind of blow up.”Bowman’s observations about Claude were part of a major model update that Anthropic announced last week. “This is not a new behavior, but is one that Claude Opus 4 will engage in somewhat more readily than prior models,” the report said. The model is the first one that Anthropic has released under its “ASL-3” distinction, meaning Anthropic considers it to be “significantly higher risk” than the company’s other models.

Read Full Article

1 week, 1 day ago: WIRED