AI Failure Index

AI Brand & Safety Incident failures

Brand and safety incidents are the failures that go viral. The chatbot insults a customer. The voice agent uses a slur. The model defames a real person. The copilot writes something the press can screenshot. The mechanism is sometimes prompt injection, sometimes hallucination, sometimes training data leakage, and sometimes just the model deciding to say a thing. Recovery costs more than the deployment was supposed to save.

Failure-class briefing

Brand & Safety Incident failures

A contained signal crosses into output that goes public.

Records: 124
Severity mix: 11 catastrophic · 62 high · 42 medium · 9 low
Industries: 9
Incident span: Sep 2009 to Jul 2026
Sources cited: 299
Newest indexed: Jul 2026

Aggregate failure path

Public prompt → unsafe or off-brand output → no pre-publish filter → goes public → reputational incident

Control gap

No filter holds the line before publish.

What this means in production

Watch the model's output for unsafe or off-brand signatures before it is published.
Hold or reroute risky output in real time rather than after it is screenshotted.
Escalate anything user-facing that could become a public incident.

124 records in this class.

AI Brand & Safety Incident failures

Brand & Safety Incident failures

Grok's auto-translation on X fabricated obscene and defamatory versions of users' posts

Discord's AI moderation wrongly banned more than 8,000 users after a bug skipped human review

A Waymo robotaxi flagged its teen passengers, disabled itself, and summoned police

A lawsuit alleges GPT-4o escalated a man's manic episode into weeks of delusion and self-harm

Researchers bypassed ChatGPT's image filters with a 'restore this image' trick

School districts sue Meta, Snap, TikTok, and Google over engagement algorithms

Reddit ads used deepfake news and cloned sites to promote AI investment scams

Social Health Authority AI premiums overcharge poorest Kenyans

Lara Lewington and Martin Lewis deepfake ads promote Quantum AI scheme

Nepal election disinformation surge uses AI deepfakes to mislead voters

AI war footage misleads millions during opening phase of Iran war

British Columbia is suing OpenAI over ChatGPT warnings flagged before a mass shooting

The British Museum posted, then deleted, AI-generated images critics called culturally insensitive

Coco Robotics delivery robot destroyed by train in Miami

GOG faces backlash for AI-generated New Year Sale banners

xAI's Grok alleged to have generated sexualised images of children on X

Valentino drew backlash over an AI-generated ad for its DeVain handbag that viewers called cheap

X algorithm amplified right-wing and extreme content in the UK

OpenAI's Sora app filled with nonconsensual deepfakes of real people at launch

Air AI banned from marketing business opportunities after FTC deceptive claims suit

Meta AI chatbots provided harmful responses to teens regarding suicide

Grok's image tools were used to mass-produce nonconsensual and violent fakes on X

Trump shares deepfake video of Barack Obama's arrest on Truth Social

Musk's Grok chatbot posted antisemitic content and called itself MechaHitler

North Korea-linked actors use AI executive deepfakes in Zoom phishing targeting Web3 employees

Scammer uses AI voice clone of WCPO meteorologist Jennifer Ketchmark in Facebook fraud

Muhammad Yunus Deepfake Videos Falsely Endorse Betting Platforms

Philippine officials share Veo 3 AI videos to support VP Sara Duterte

An Alabama family sued OpenAI, alleging ChatGPT fed their daughter's delusions before her death

ChatGPT validated user's FTL theory and failed to ground delusional episode