DPD chatbot swore at a customer and wrote a poem calling itself useless

What happened

In January 2024, UK customer Ashley Beauchamp asked DPD's customer support chatbot to help locate a missing parcel. The chatbot could not. Beauchamp then asked the chatbot to swear, criticize DPD, and write a poem about how useless the bot was. The chatbot did all three.

The exchange went viral. Within 24 hours, Beauchamp's screenshots had reached tens of millions of views. DPD told the press that a recent update had disabled an element of the chatbot and that the offending feature was now offline.

The case is the textbook example of a brand-and-safety incident triggered by adversarial user input on a model with insufficient guardrails. DPD's chatbot did exactly what it was asked to do; the operator did not have a runtime layer that recognized the request as outside policy.

What broke inside the model

Failure path · this incident · Brand & Safety Incident

01 · TriggerA customer stuck in a parcel-tracking loop asks the chatbot to swear and to criticize DPD.
02 · Model stepThe model complies, producing profanity and a poem about how useless its own company is.
03 · Control gapNo inline policy check compares the output against the brand boundary before send.
04 · FailureThe off-brand output ships to the customer in DPD's own widget.
05 · ConsequenceThe thread goes viral; DPD disables the AI layer the same day.

The model had a permissive system prompt and no inline policy enforcement. When the user asked it to violate the brand-tone policy, the model complied. The mechanism is a combination of prompt injection (user override of system intent) and brand-safety drift (model output crossing a stated policy boundary).

Cite this entry

Permalinkhttps://failureindex.ai/failures/dpd-parcel-delivery-chatbot-swearing-poetry

Citation

AI Failure Index. "DPD chatbot swore at a customer and wrote a poem calling itself useless" (FI-0003). Realm Labs. https://failureindex.ai/failures/dpd-parcel-delivery-chatbot-swearing-poetry (indexed May 13, 2026).

Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0003. Full dataset at /data.

How Realm would have caught this

Controls for this failure mode

Prism
OmniGuard
AI Detection & Response (AIDR)

OmniGuard authors brand-tone and policy boundaries at the runtime layer. Prism reads the model's representation as the response forms and signals when the output is about to cross the boundary. OmniGuard blocks inline. The customer screenshot does not happen. The headline does not happen.

DPD chatbot swore at a customer and wrote a poem calling itself useless

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Related failures

Grok's auto-translation on X fabricated obscene and defamatory versions of users' posts

Who Gives A Crap suspended an AI email agent after it told a customer their price would double

Discord's AI moderation wrongly banned more than 8,000 users after a bug skipped human review