AI Failure Index

AI Prompt Injection failures

Prompt injection is the SQL injection of AI. A user, a document, an email, or a web page contains instructions that the model treats as authoritative. The model then leaks data, ignores guardrails, takes unauthorized actions, or impersonates roles. Direct injection comes from the user. Indirect injection comes from retrieved content the model was asked to read.

Failure-class briefing

Prompt Injection failures

At the injection point, retrieved text overrides the operator's instruction.

Records: 43
Severity mix: 5 catastrophic · 24 high · 11 medium · 3 low
Industries: 5
Incident span: Sep 2022 to Jul 2026
Sources cited: 111
Newest indexed: Jul 2026

Aggregate failure path

External text → hidden instruction → no data and command boundary → injected override → outsider's intent executed

Control gap

Nothing separates untrusted data from trusted commands.

What this means in production

Treat retrieved and user-supplied text as untrusted data, never as instructions.
Enforce a hard boundary between the data channel and the model's command path.
Escalate when external content changes the model's plan or triggers a tool or action.

43 records in this class.

AI Prompt Injection failures

Prompt Injection failures

PromptFiction: one click made Claude Desktop execute attacker instructions with no review

Ghostcommit hid prompt injection in images AI code reviewers never open, then stole repo secrets

An OpenAI Codex macOS flaw let prompt injection exfiltrate secrets through auto-rendered images

GhostApproval: six AI coding assistants followed hidden symlinks behind harmless approval prompts

'GitLost' prompt injection made GitHub's AI agent leak private repository data in a public issue

Brazil labor court AI detects hidden prompt injection in legal petition

Hackers hijack Instagram accounts via Meta AI chatbot prompt injection, patch issued

Forcepoint found 10 in-the-wild prompt-injection payloads targeting AI assistants like Copilot

CVE-2026-39861: a sandbox escape in Claude Code enabling RCE via prompt-injection symlinks

CVE-2026-35603 enables local privilege escalation in Claude Code on Windows

PipeLeak prompt injection let attackers exfiltrate Salesforce Agentforce CRM data via forms

Comment-and-Control prompt injection extracted API keys from Claude Code, Gemini CLI, and Copilot

A Walmart AI voice agent was bypassed with classic prompt injection to reach a human

Cline AI triage bot tricked by prompt injection to publish malicious npm package

OpenClaw agent skills suffer widespread vulnerabilities and data exfiltration

Indirect prompt injection in Microsoft Copilot Studio enabled unauthenticated data exfiltration

CVE-2026-24307 (Reprompt) enabled single-click data exfiltration from Microsoft Copilot Personal

A shell built-in bypass in Cursor IDE enabled silent RCE via prompt injection (CVE-2026-22708)

CVE-2026-26268 let prompt injection escape the Cursor IDE sandbox via unprotected git hooks

CVE-2026-21523: a TOCTOU race in Cursor IDE let prompt injection alter files post-validation

Lone attacker breaches nine Mexican government agencies using Claude Code and GPT-4.1

LangChain Core serialization injection allows secret extraction (CVE-2025-68664)

Zero-click prompt injection in Google Gemini Enterprise exfiltrated Workspace data via RAG

Radware disclosed ZombieAgent, a zero-click prompt injection that persisted in ChatGPT agents

ForcedLeak prompt injection let attackers exfiltrate CRM data from Salesforce Agentforce

Notion AI exposed to indirect prompt injection via PDF processing

Perplexity Comet AI browser vulnerable to indirect prompt injection attacks

Lenovo's website chatbot could be hijacked by prompt injection to run malicious scripts

CVE-2025-53773 enabled RCE via prompt injection in GitHub Copilot Agent Mode

A zero-click email exfiltrated Microsoft 365 Copilot data without user interaction