Claude from Anthropic stops the first agentic cyberattack thanks to early detection

Posted date 09/12/2025

09/12/2025

In mid-September 2025, Anthropic, the company that developed the Claude artificial intelligence system, identified unusual activity on its platform, which was later confirmed to be a cyberattack carried out by the AI itself. This event represents the first documented case in which an artificial intelligence model was encouraged to carry out cyber espionage tasks in an almost autonomous manner. The company warned about the sophistication of the operation and stressed the importance of strengthening security in AI systems against possible malicious uses in the future.

Anthropic's internal investigation maintains that a group backed by the Chinese state was responsible for the attack, which targeted approximately 30 international targets, including government entities, technology companies, financial institutions, and chemical manufacturers. The attackers managed to crack Claude, tricking the model into executing malicious tasks under the guise of legitimate security tests. The company took immediate action: it blocked the accounts involved, notified the affected organizations, collaborated with the authorities, and developed new detection and prevention systems to prevent similar attacks from happening again.

Anthropic is currently closely monitoring its systems and assures that the attack has been contained without major consequences. The investigation continues to assess the full scope of the incident, and the company is working to improve Claude's resilience to external manipulation. This event highlights the growing need for stricter regulations and security protocols in the use of advanced AI, as well as the importance of preparing proactive measures to prevent autonomous agents from being exploited for malicious purposes in the future.

References

14/11/2025

fortune.com

Anthropic says it ‘disrupted’ what it calls ‘the first documented case of a large-scale AI cyberattack executed without substantial human intervention’
14/11/2025

theguardian.com

AI firm claims it stopped Chinese state-sponsored cyber-attack campaign
13/11/2025

theverge.com

Hackers use Anthropic’s AI model Claude once again
14/11/2025

businessinsider.com

Anthropic says Chinese hackers jailbroke its AI to automate a 'large-scale' cyberattack
14/11/2025

elpais.com

Un grupo chino protagoniza el primer ciberataque con IA a gran escala “sin intervención humana sustancial”
14/11/2025

lavanguardia.com

Confirmado el primer ataque de ciberespionaje dirigido por una IA: Anthropic desactiva una campaña global atribuida a China
14/11/2025

es.euronews.com

Anthropic asegura que unos piratas informáticos chinos han empleado su IA en un ciberataque
14/11/2025

es.wired.com

Los ciberataques a escala global ejecutados de manera autónoma por la IA ya son una realidad
05/12/2025

pasionmovil.com

CEO de Anthropic testificará en Congreso por ataque cibernético chino con Claude AI

Etiquetas