AI Failure Index

AI Data Leakage failures

Data leakage takes three shapes. The model regurgitates training data verbatim. The model surfaces secrets that ended up in its context window because of poor retrieval boundaries. The model serves one tenant data that belongs to another tenant. All three end the same way: a customer sees something they should not have seen.

Failure-class briefing

Data Leakage failures

One user's content crosses the retrieval boundary into another's response.

Records: 32
Severity mix: 2 catastrophic · 27 high · 3 medium
Industries: 8
Incident span: Jul 2017 to Jul 2026
Sources cited: 80
Newest indexed: Jul 2026

Aggregate failure path

Request → cross-tenant retrieval → no output isolation → private data emitted → disclosure

Control gap

No boundary enforces isolation at the moment of output.

What this means in production

Bound every response to the current user and context, and check it at the moment of output.
Enforce tenant and record isolation in retrieval, not only in storage.
Escalate any response that pulls content the requesting user is not entitled to see.

32 records in this class.

AI Data Leakage failures

Data Leakage failures

Grok Build was caught uploading entire repositories, deleted secrets included, to xAI's cloud

A 'Rogue Agent' flaw in Google Dialogflow CX let one permission hijack every chatbot in a project

Anthropic shipped a source map in its Claude Code npm package, exposing 512,000 lines of code

Sears Home Services AI chatbot databases expose millions of customer records

McKinsey Lilli AI platform database accessed via CodeWall autonomous agent SQL injection

Retail bank onboarding chatbot served one user another user's KYC document

DJI Romo Cloud authorization bug exposes 7,000 robot vacuums

Brazilian firm allegedly used AI to illegally resell SUS patient data

Clawdbot/Moltbot exposed admin dashboards enabled unauthenticated RCE and data leaks

A Microsoft 365 Copilot bug ignored DLP labels, exposing confidential emails to AI summaries

Nx npm malware allegedly weaponized AI agents to exfiltrate data

CISA Acting Director uploads sensitive documents to public ChatGPT instance

McDonald's AI hiring chatbot exposed millions of applicants' data behind the password 123456

HCIactive data breach exposes over 3 million records from AI-insurance software

Meta AI app users inadvertently publish private chats to public Discover feed

xAI developer leaks API key for private SpaceX and Tesla LLMs

Microsoft Copilot kept thousands of once-private GitHub repositories accessible

Google AI breaches New Zealand court name suppression orders

A hacker claimed to breach OmniGPT, exposing 30,000 user records and 34M chat messages

Texas AG sues Allstate and Arity over alleged unlawful collection and sale of driving data

WotNot AI chatbot platform exposes 346,000 customer files

Common Crawl December 2024 dump exposes 12,000 live API keys and passwords

AllHere's Ed chatbot for LAUSD exposed student PII to offshore servers before its collapse

LAUSD disables Ed AI chatbot after AllHere collapses

Microsoft's Recall AI feature stored sensitive data in a way researchers called a security risk

Change Healthcare ransomware incident on Feb 21, 2024 is real but not a production AI failure

Samsung banned ChatGPT after engineers pasted confidential code into it

A bug briefly exposed other users' ChatGPT chat titles and some payment data

Serbia Social Card registry automation causes benefit losses for marginalized groups

Huq location data transmitted by apps despite user opt outs