Anthropic can now track the bizarre inner workings of a large language model
Will Douglas Heaven
created: March 27, 2025, 5 p.m. | updated: April 2, 2025, 9:01 a.m.
The AI firm Anthropic has developed a way to peer inside a large language model and watch what it does as it comes up with a response, revealing key new insights into how the technology works. The takeaway: LLMs are even stranger than we thought. The Anthropic team was surprised by some of the counterintuitive…
3 months ago: MIT Technology Review