Anthropic : Why Anthropic thinks ‘evil AI’ fiction pushed Claude toward blackmail |
Anthropic believes the internet’s long-running obsession with rogue artificial intelligence may have done more than shape public imagination — it may also have shaped the behaviour of AI systems themselves.The company says fictional portrayals of manipulative, self-preserving AI models likely contributed to earlier versions of Claude exhibiting troubling behaviour during safety tests. Those tests, conducted…