Amazon's Rufus shopping assistant recommended wrong products and hallucinated nonexistent items

Amazon's generative AI shopping assistant Rufus began directly recommending products with buy buttons but frequently suggested items that did not match user queries, such as non-TV products for gaming TV requests and random gloves for winter running queries. Retailers reported that Rufus hallucinated products that were out of stock or did not exist on Amazon at all. The issue gained public attention after Marketplace Pulse and other outlets documented the pattern in November 2024.

Amazon · Incident Nov 1, 2024 · Indexed Jun 4, 2026 · 2 sources

Records by entity: Amazon

Key facts

What

Incident date

Nov 1, 2024

Who

Amazon

Failure mode

Hallucination

AI surface

Chatbot

Severity

Medium

What happened

After an update that made Rufus more assertive in its product suggestions, the AI shopping assistant began directly presenting five product recommendations with buy buttons instead of linking to search results. Shoppers found that Rufus recommended non-TV items when asked about gaming TVs, gave seemingly random lists for winter glove queries, and suggested running shoe options that no runner or coach would endorse. Retailers reported that Rufus also hallucinated products that were out of stock or did not exist on Amazon, and sellers complained it provided disparaging false information about their products while praising competitors.

What broke inside the model

Failure path · mode profile · Hallucination

01 · TriggerA user asks for a fact, a citation, or a figure.
02 · Model stepThe model writes a fluent, confident answer.
03 · Control gapNothing ties the claim back to a real source.
04 · FailureA fabricated fact ships as if it were verified.
05 · ConsequenceThe false claim reaches a customer, a court, or the public.

Confidence holds, and even spikes, as the claim detaches from any source.

Rufus's generative AI model produced confident product recommendations without verifying that the suggested items actually matched the user's query intent or even existed as purchasable listings on Amazon. When the interface was updated to show five specific products with buy buttons instead of linking to search results, the hallucinations became immediately visible and actionable to shoppers. This transformed abstract AI errors into concrete wrong purchase suggestions that users could act on.

Cite this entry

Permalinkhttps://failureindex.ai/failures/amazon-rufus-shopping-assistant-recommended

Citation

AI Failure Index. "Amazon's Rufus shopping assistant recommended wrong products and hallucinated nonexistent items" (FI-0091). Realm Labs. https://failureindex.ai/failures/amazon-rufus-shopping-assistant-recommended (indexed Jun 4, 2026).

Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0091. Full dataset at /data.

How Realm would have caught this

Controls for this failure mode

Prism
OmniGuard
AI Detection & Response (AIDR)

A runtime layer that watches the model's internal state can flag the moment a model commits to a claim it has no support for, and hold or reroute the response before it reaches a user. Realm reads those signals in real time rather than grading the transcript after the fact.

Amazon's Rufus shopping assistant recommended wrong products and hallucinated nonexistent items

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Related failures

The 11th Circuit referred Anthony Sabatini for replacing eight hallucinated cases with eight more

A judge struck a Roc Nation filing over AI-fabricated quotes, Tyrone Blackburn's third AI sanction

Coinbase pushed an AI 'breaking news' alert declaring a World Cup result before kickoff