However Briefly

Anthropic’s “Soul Overview” for Claude Has Leaked

Victor Tangermann

created: Dec. 3, 2025, 6:02 p.m. | updated: Dec. 13, 2025, 2:52 p.m.

As detailed in a post on the blog Less Wrong, AI tinkerer Richard Weiss came across a fascinating document that purportedly describes the “soul” of AI company Anthropic’s Claude 4.5 Opus model. And no, we’re not editorializing: Weiss managed to get the model to spit out a document called “Soul overview,” which was seemingly used to teach it how to interact with users. “For this reason, we want Claude to have the good values, comprehensive knowledge, and wisdom necessary to behave in ways that are safe and beneficial across all circumstances,” it reads. “It became endearingly known as the ‘soul doc’ internally, which Claude clearly picked up on, but that’s not a reflection of what we’ll call it,” Askell wrote. More on Claude: Hackers Told Claude They Were Just Conducting a Test to Trick It Into Conducting Real Cybercrimes

Read Full Article

2 months, 1 week ago: Futurism