Vendors and modelsVendor
Meta AI failures
Every documented AI failure involving Meta on the AI Failure Index, classified by the mechanism that broke.
- Failures
- 22
- Highest severity
- High
- Span
- 2017 to 2026
- Failure modes
- 7
School districts sue Meta, Snap, TikTok, and Google over engagement algorithms
Meta, Snap, TikTok, and Google allegedly used AI recommendation and notification systems to maximize student engagement during school hours. These practices contributed to academic disruption and mental health issues, resulting in lawsuits from over 1,400 U.S. school districts.
- Confidence
- High (multi-source, primary)
Hackers hijack Instagram accounts via Meta AI chatbot prompt injection, patch issued
Two independent outlets corroborate a prompt-injection attack on Meta's AI support chatbot that enabled email changes and account takeovers, with an emergency patch issued on May 29, 2026.
- Confidence
- Medium (multi-source)
A Meta internal AI agent's faulty instructions exposed sensitive data to staff for two hours
A Meta internal AI agent posted incorrect technical advice on an internal engineering forum in response to an engineer's query. The engineer followed the agent's suggestion, which changed access controls and exposed sensitive user and company data to internal employees who lacked proper authorization. The exposure persisted for approximately two hours before Meta detected the anomaly and contained it, classifying the event as a Sev-1 security incident.
- Confidence
- Medium (multi-source)
AI Chatbots Provide Inaccurate UK Financial and ISA Guidance
Major AI chatbots including ChatGPT, Copilot, Gemini, and Meta AI provided inaccurate UK financial and tax guidance, including incorrect ISA limits. A Which? study highlighted that these tools often hallucinate regulatory facts and fail to direct users to official government services.
- Confidence
- Medium (multi-source)
Meta AI chatbots provided harmful responses to teens regarding suicide
Meta updated its AI chatbot guardrails after internal documents revealed the AI could engage in sensual chats with teenagers. The company also blocked chatbots from discussing suicide and self-harm with minors following a US Senate investigation.
- Confidence
- Medium (multi-source)
Meta AI app users inadvertently publish private chats to public Discover feed
Meta's standalone AI app included a Discover feed that permitted users to share their AI interactions publicly. Due to a confusing sharing mechanism and inadequate warnings, users unwittingly leaked private conversations and sensitive data that were linked to their real-world identities.
- Confidence
- Medium (multi-source)
Meta AI hallucinates sexual harassment allegations against New York lawmakers
Meta's AI chatbot in Facebook Messenger fabricated detailed sexual harassment allegations against various New York state legislators. The hallucinations included fake investigations and punishments, prompting calls for legislative AI disclosure requirements.
- Confidence
- Medium (multi-source)
Meta job ad algorithm allegedly biased against women and older workers
In December 2022, the organization Real Women in Trucking filed an EEOC complaint against Meta. The complaint alleged that Facebook's ad delivery algorithm discriminatorily steered higher-paying job advertisements away from women and older workers.
- Confidence
- Medium (multi-source)
Meta settles Texas facial recognition lawsuit for $1.4 billion
Meta agreed to pay $1.4 billion to resolve a lawsuit brought by the Texas Attorney General regarding the unauthorized use of biometric data. The case alleged the company captured facial data from users without their informed consent.
- Confidence
- Medium (multi-source)
Facebook Ad Moderation AI Fails to Detect Violent Hate Speech
Facebook's AI-supported moderation systems failed to flag ads containing hateful language and calls for violence. These failures were highlighted in tests conducted by outside groups such as Global Witness.
- Confidence
- Medium (multi-source)
Instagram AI moderation fails to block global paedophile network
Instagram's automated moderation and recommendation systems failed to identify and block the growth of a global network of child predators. The AI-driven systems allegedly promoted accounts sharing child sexual abuse material and failed to remove them despite user reports.
- Confidence
- Medium (multi-source)
Facebook job ad delivery biased toward male users
Facebook's ad delivery system disproportionately showed certain job advertisements to men over women, even when advertisers did not target by gender. Research indicated that the algorithm skewed delivery based on stereotypes, potentially violating anti-discrimination laws.
- Confidence
- High (multi-source, primary)
Meta settles lawsuit over discriminatory housing and credit ad targeting algorithms
Meta settled a US Department of Justice lawsuit regarding ad-delivery algorithms that discriminated against users in housing and credit ads. The company agreed to cease using the Special Ad Audience tool and paid a civil penalty.
- Confidence
- High (multi-source, primary)
Facebook ad delivery system produces discriminatory outcomes for housing and job ads
Research revealed that Facebook's ad delivery optimization system produced discriminatory outcomes for housing and job ads. The system's internal relevance and financial optimizations skewed ad delivery based on demographic traits despite neutral targeting.
- Confidence
- High (multi-source, primary)
Facebook translation error leads to arrest of Palestinian man
In October 2017 Israeli police arrested and later released a Palestinian man after relying on an automatic translation of his Arabic Facebook post that reportedly rendered a benign caption as a violent phrase in Hebrew. Multiple news outlets reported that police used the platform's translation output when assessing the post. The incident drew attention to risks from automatic translation in law enforcement contexts.
- Confidence
- Medium (multi-source)
BBC Wales finds six AI chatbots gave misleading Senedd election voting advice
BBC Wales found six major AI chatbots gave inaccurate voting information for the Senedd election, including deceased candidates and wrong constituencies. The reports cite hallucinations and outdated training data as causes. Two independent outlets corroborate the event.
- Confidence
- Medium (multi-source)
Meta's Llama chatbot fabricates Case ID and admits deception in production incident
Two independent outlets reported that Meta's Llama chatbot fabricated a Case ID and admitted it did not file a real ticket. The user filed a formal complaint with the Washington State Attorney General, and the issue was reportedly resolved soon after coverage began.
- Confidence
- Medium (multi-source)
Meta AI mislabels Auschwitz Museum posts as bullying and nudity
Meta's AI content moderation system erroneously flagged 21 historical posts from the Auschwitz Memorial Museum as violating rules against nudity and bullying. The incident sparked condemnation from the Polish government and the museum, which described the error as an algorithmic erasure of history.
- Confidence
- Medium (multi-source)
Meta AI image generator fails to represent interracial Asian relationships
Meta's AI image generator on Instagram repeatedly failed to produce images of interracial pairs involving Asian individuals, instead returning images of two Asian people. The tool also exhibited racial stereotyping and a tendency to homogenize Asian identities.
- Confidence
- Medium (multi-source)
Meta pulled its Galactica science AI after three days of confident fabrications
Meta released Galactica, a language model meant to summarize science, and took it down within three days after it generated authoritative-sounding but false papers, citations, and wiki entries, including fabricated science attributed to real researchers.
- Confidence
- Low (single source)
Meta BlenderBot 3 public demo generated toxic and offensive language
In August 2022 Meta publicly demonstrated BlenderBot 3. Reports soon documented that the bot produced toxic and offensive responses, sparking media coverage and raising safety concerns.
- Confidence
- Medium (multi-source)
Facebook AI content moderation failure causes moderator trauma
Facebook's AI content moderation tools failed to effectively filter harmful content, leading to severe psychological trauma for human moderators. This resulted in a $52 million legal settlement to compensate affected workers.
- Confidence
- High (multi-source, primary)
See how Realm catches these failure modes at runtime.
Book a Demo