X (Twitter) AI failures

Every documented AI failure involving X (Twitter) on the AI Failure Index, classified by the mechanism that broke.

Failures: 7
Highest severity: High
Span: 2020 to 2026
Failure modes: 3

By failure mode

By severity

AI-generated images spread false Maduro capture claim on X

Following the alleged capture of Nicolás Maduro in January 2026, AI-generated images and videos circulated widely on X, falsely depicting his capture and subsequent celebrations. These visuals were identified as sophisticated fabrications, some detected with AI watermarks, and contributed to widespread misinformation.

Confidence: Medium (multi-source)

X3 sourcesPressPublicJan 2026

FI-0432Cross-industryHigh

Brand & Safety Incident

X algorithm amplified right-wing and extreme content in the UK

Investigations and academic research documented that X’s recommendation/feed algorithm systematically promoted right‑wing and, in many cases, extreme content to UK users. Sky News’ controlled experiment (reported via AIAAIC and GIJN) found a majority share of political posts shown to test accounts came from right‑wing or extreme accounts, and a 2026 peer‑reviewed Nature study found X’s algorithm promotes conservative content relative to a chronological feed. Multiple independent sources report these findings publicly.

Confidence: High (multi-source, primary)

X (formerly Twitter)4 sourcesPrimaryPublicNov 2025

FI-0643Cross-industryHigh

Hallucination

X's Grok AI generates fake Iran missile strike headline

Grok AI on X generated a false headline claiming that Iran had launched missile strikes on Tel Aviv. This misinformation was then promoted via X's trending Explore tab, misleading a large number of users.

Confidence: Medium (multi-source)

X2 sourcesPressPublicApr 2024

FI-0624Cross-industryHigh

Brand & Safety Incident

X fails to prevent viral spread of Bobbi Althoff deepfake pornography

Nonconsensual AI-generated pornography of Bobbi Althoff went viral on X in February 2024. The incident underscored critical gaps in the platform's ability to detect and moderate nonconsensual intimate imagery.

Confidence: Medium (multi-source)

X (Twitter)2 sourcesPressPublicFeb 2024

FI-0573Cross-industryHigh

Brand & Safety Incident

Twitter automated moderation linked to surge in harmful content

Twitter shifted to AI-driven content moderation after significantly reducing its human moderation staff, leading to a reported surge in hate speech. The transition highlighted the limitations of automated systems in managing nuanced harmful content without human oversight.

Confidence: High (multi-source, primary)

Twitter3 sourcesPrimaryPublicDec 2022

FI-0600Cross-industryHigh

Brand & Safety Incident

Twitter Bots Comprised Nearly Half of COVID-19 Discussion Accounts

A Carnegie Mellon University study found that bots comprised up to 60 percent of Twitter accounts participating in COVID-19 discussions. These bots were used to amplify calls to reopen the country, undermining the integrity of public discourse.

Confidence: High (multi-source, primary)

Twitter2 sourcesPrimaryPublicMay 2020

FI-0459SaaSMedium

Agentic Action Error

Twitter Japan suspends accounts of critics of Prime Minister Suga

In June-July 2021 multiple accounts critical of Prime Minister Suga were temporarily frozen by Twitter Japan and later restored. Twitter Japan told reporters the incidents were caused by its AI-powered account-flagging system misidentifying accounts as hijacked or spam. The events drew public criticism and media coverage but no public regulatory enforcement action is documented in the cited sources.

Confidence: Medium (multi-source)

Twitter Japan (Twitter, Inc.)3 sourcesPressPublicJun 2021

See how Realm catches these failure modes at runtime.

Book a Demo