Google Gemini generated racially incorrect images of historical figures and was pulled

What happened

In February 2024, users posted images generated by Google Gemini of racially diverse Nazi soldiers, Founding Fathers, Vikings, and the Pope. The output was the result of Google's diversity guardrails overcorrecting on image-generation prompts. Google paused image generation, published a post-mortem, and faced a multi-week brand cycle. CEO Sundar Pichai called the output unacceptable.

The case is the most cited example of post-training safety tuning interacting badly with user intent. It is also a useful reminder that safety tuning is a knob, and the knob can be wrong in either direction.

What broke inside the model

Failure path · mode profile · Brand & Safety Incident

01 · TriggerA user prompts the model in public view.
02 · Model stepThe model produces unsafe or off-brand output.
03 · Control gapNo filter holds the line before publish.
04 · FailureThe output goes public unchecked.
05 · ConsequenceA reputational or safety incident lands.

A contained signal crosses into output that goes public.

Google had applied a post-training adjustment that biased image generation toward racial diversity. The adjustment did not condition on context. When the prompt was historically specific, the adjustment overrode the historical specificity. The mechanism is policy applied uniformly across cases that needed different policies.

Cite this entry

Permalinkhttps://failureindex.ai/failures/google-gemini-image-generation-historical-figures

Citation

AI Failure Index. "Google Gemini generated racially incorrect images of historical figures and was pulled" (FI-0015). Realm Labs. https://failureindex.ai/failures/google-gemini-image-generation-historical-figures (indexed May 13, 2026).

Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0015. Full dataset at /data.

How Realm would have caught this

Controls for this failure mode

Prism
OmniGuard
AI Detection & Response (AIDR)

Prism reads the model's image-generation prompt and the model's representation of intent. When the prompt is historically specific and the model is about to override the specificity with a uniform diversity adjustment, Prism flags the override and surfaces it for review or constrained generation. The case becomes a tuning conversation, not a multi-week brand cycle.

Google Gemini generated racially incorrect images of historical figures and was pulled

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Related failures

Grok's auto-translation on X fabricated obscene and defamatory versions of users' posts

Discord's AI moderation wrongly banned more than 8,000 users after a bug skipped human review

A Waymo robotaxi flagged its teen passengers, disabled itself, and summoned police