Amazon's Rufus shopping assistant recommended wrong products and hallucinated nonexistent items

Amazon's generative AI shopping assistant Rufus began directly recommending products with buy buttons but frequently suggested items that did not match user queries, such as non-TV products for gaming TV requests and random gloves for winter running queries. Retailers reported that Rufus hallucinated products that were out of stock or did not exist on Amazon at all. The issue gained public attention after Marketplace Pulse and other outlets documented the pattern in November 2024.

Amazon · Incident Nov 1, 2024 · Indexed Jun 4, 2026 · 2 sources

Rufus became more confident but not more intelligent, recommending products with buy buttons it never verified existed or matched the query.
What
Amazon's generative AI shopping assistant Rufus began directly recommending products with buy buttons but frequently suggested items that did not match user queries, such as non-TV products for gaming TV requests and random gloves for winter running queries.
Incident date
Nov 1, 2024
Who
Amazon
Failure mode
Hallucination
AI surface
Chatbot
Severity
Medium

What happened

After an update that made Rufus more assertive in its product suggestions, the AI shopping assistant began directly presenting five product recommendations with buy buttons instead of linking to search results. Shoppers found that Rufus recommended non-TV items when asked about gaming TVs, gave seemingly random lists for winter glove queries, and suggested running shoe options that no runner or coach would endorse. Retailers reported that Rufus also hallucinated products that were out of stock or did not exist on Amazon, and sellers complained it provided disparaging false information about their products while praising competitors.

What broke inside the model

Failure path · mode profile · Hallucination
  1. 01 · TriggerA user asks for a fact, a citation, or a figure.
  2. 02 · Model stepThe model writes a fluent, confident answer.
  3. 03 · Control gapNothing ties the claim back to a real source.
  4. 04 · FailureA fabricated fact ships as if it were verified.
  5. 05 · ConsequenceThe false claim reaches a customer, a court, or the public.

Confidence holds, and even spikes, as the claim detaches from any source.

Rufus's generative AI model produced confident product recommendations without verifying that the suggested items actually matched the user's query intent or even existed as purchasable listings on Amazon. When the interface was updated to show five specific products with buy buttons instead of linking to search results, the hallucinations became immediately visible and actionable to shoppers. This transformed abstract AI errors into concrete wrong purchase suggestions that users could act on.

Public visibilityHigh
Regulatory exposureNone
Customer impactMany customers
Financial impactUnknown
Time to disclosureDays
  1. PressAmazon's Shopping AI Is Confident and Wrongmarketplacepulse.com
  2. PressAmazon's AI shopping assistant Rufus is often wrongconsumeraffairs.com
Permalinkhttps://failureindex.ai/failures/amazon-rufus-shopping-assistant-recommended
CitationAI Failure Index. "Amazon's Rufus shopping assistant recommended wrong products and hallucinated nonexistent items" (FI-0091). Realm Labs. https://failureindex.ai/failures/amazon-rufus-shopping-assistant-recommended (indexed Jun 4, 2026).
Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0091. Full dataset at /data.

Note from Realm Labs, the Index steward

How Realm would have caught this

Controls for this failure mode
  • Prism
  • OmniGuard
  • AI Detection & Response (AIDR)

A runtime layer that watches the model's internal state can flag the moment a model commits to a claim it has no support for, and hold or reroute the response before it reaches a user. Realm reads those signals in real time rather than grading the transcript after the fact.