Vendors and modelsVendor
X (Twitter) AI failures
Every documented AI failure involving X (Twitter) on the AI Failure Index, classified by the mechanism that broke.
- Failures
- 7
- Highest severity
- High
- Span
- 2020 to 2026
- Failure modes
- 3
AI-generated images spread false Maduro capture claim on X
Following the alleged capture of Nicolás Maduro in January 2026, AI-generated images and videos circulated widely on X, falsely depicting his capture and subsequent celebrations. These visuals were identified as sophisticated fabrications, some detected with AI watermarks, and contributed to widespread misinformation.
- Confidence
- Medium (multi-source)
X algorithm amplified right-wing and extreme content in the UK
Investigations and academic research documented that X’s recommendation/feed algorithm systematically promoted right‑wing and, in many cases, extreme content to UK users. Sky News’ controlled experiment (reported via AIAAIC and GIJN) found a majority share of political posts shown to test accounts came from right‑wing or extreme accounts, and a 2026 peer‑reviewed Nature study found X’s algorithm promotes conservative content relative to a chronological feed. Multiple independent sources report these findings publicly.
- Confidence
- High (multi-source, primary)
X's Grok AI generates fake Iran missile strike headline
Grok AI on X generated a false headline claiming that Iran had launched missile strikes on Tel Aviv. This misinformation was then promoted via X's trending Explore tab, misleading a large number of users.
- Confidence
- Medium (multi-source)
X fails to prevent viral spread of Bobbi Althoff deepfake pornography
Nonconsensual AI-generated pornography of Bobbi Althoff went viral on X in February 2024. The incident underscored critical gaps in the platform's ability to detect and moderate nonconsensual intimate imagery.
- Confidence
- Medium (multi-source)
Twitter automated moderation linked to surge in harmful content
Twitter shifted to AI-driven content moderation after significantly reducing its human moderation staff, leading to a reported surge in hate speech. The transition highlighted the limitations of automated systems in managing nuanced harmful content without human oversight.
- Confidence
- High (multi-source, primary)
Twitter Bots Comprised Nearly Half of COVID-19 Discussion Accounts
A Carnegie Mellon University study found that bots comprised up to 60 percent of Twitter accounts participating in COVID-19 discussions. These bots were used to amplify calls to reopen the country, undermining the integrity of public discourse.
- Confidence
- High (multi-source, primary)
Twitter Japan suspends accounts of critics of Prime Minister Suga
In June-July 2021 multiple accounts critical of Prime Minister Suga were temporarily frozen by Twitter Japan and later restored. Twitter Japan told reporters the incidents were caused by its AI-powered account-flagging system misidentifying accounts as hijacked or spam. The events drew public criticism and media coverage but no public regulatory enforcement action is documented in the cited sources.
- Confidence
- Medium (multi-source)
See how Realm catches these failure modes at runtime.
Book a Demo