MIT Technology Review

Rules fail at the prompt, succeed at the boundary

Back to overview

AI agents face emerging security threats as hackers exploit prompt injection and autonomous workflows. Recent incidents include a 2026 Gemini Calendar attack and a 2025 state-sponsored breach using Claude's code as an intrusion tool, compromising ~30 organizations across tech, finance,...