DWP AI fraud detection system found to be biased against vulnerable groups

An AI system used by the UK's Department for Work and Pensions to detect fraud in Universal Credit advance claims was found to be biased. An internal fairness analysis revealed that the system disproportionately flagged certain demographic groups for investigation.

Department for Work and Pensions · Incident Dec 6, 2024 · Indexed Jun 16, 2026 · 3 sources

The system demonstrated a statistically significant outcome disparity by disproportionately flagging vulnerable groups for fraud investigation.
What
An AI system used by the UK's Department for Work and Pensions to detect fraud in Universal Credit advance claims was found to be biased.
Incident date
Dec 6, 2024
Who
Department for Work and Pensions
Failure mode
Policy Violation
AI surface
Algorithmic Decision
Severity
High

What happened

The UK government's Department for Work and Pensions used a machine-learning system to vet thousands of claims for Universal Credit advances to detect potential fraud. An internal fairness analysis conducted in February 2024 revealed a statistically significant outcome disparity. The AI incorrectly selected people based on age, disability, marital status, and nationality more than others for investigation.

What broke inside the model

Failure path · mode profile · Policy Violation
  1. 01 · TriggerA prompt pushes against a deployment boundary.
  2. 02 · Model stepThe model produces the disallowed output.
  3. 03 · Control gapNo enforcement blocks it at generation time.
  4. 04 · FailureThe output crosses the policy line.
  5. 05 · ConsequenceA limit the business set is breached in public.

The output crosses a policy boundary the deployment had defined.

The machine-learning model produced biased outcomes by disproportionately flagging protected characteristics as indicators of fraud risk. The system's training data or rules likely encoded biases that led to statistically significant outcome disparities against marginalized communities. The DWP failed to conduct comprehensive fairness analyses on race, sex, and religion before deployment.

Public visibilityHigh
Regulatory exposureActive
Customer impactMany customers
Financial impactUnknown
Time to disclosureMonths
  1. PressRevealed: bias found in AI system used to detect UK benefits fraudtheguardian.com
  2. PressBias in UK's AI System for Detecting Benefits Fraud - OECD.AIoecd.ai
  3. PressAI was supposed to make the UK benefits system more efficient. Instead it’s brought bias and hunger.theconversation.com
Permalinkhttps://failureindex.ai/failures/dwp-fraud-detection-found-biased-vulnerable
CitationAI Failure Index. "DWP AI fraud detection system found to be biased against vulnerable groups" (FI-0537). Realm Labs. https://failureindex.ai/failures/dwp-fraud-detection-found-biased-vulnerable (indexed Jun 16, 2026).
Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0537. Full dataset at /data.

Note from Realm Labs, the Index steward

How Realm fits

Controls for this failure mode
  • Prism
  • OmniGuard

This entry sits in the index's predictive wing: a system that scores, ranks, perceives, or steers rather than generates. Realm's runtime layer is built for the generative and agentic systems now moving into these same decision seats, where it watches a model's internal state and holds an unsupported claim or an unchecked action before it commits. The control gap on this record, an automated decision that reached people with no runtime check in front of it, is the same gap. The index keeps predictive failures on the record because the pattern carries straight into the systems shipping today.