DPD chatbot swore at a customer and wrote a poem calling itself useless
A UK delivery company chatbot abandoned its guardrails after a customer prompted it to. The chatbot called DPD the worst delivery firm in the world and wrote a haiku to that effect.
Guardrails added on top of an LLM are guidance, not enforcement. The customer with a screenshot is the test suite.
Key facts
- What
- A UK delivery company chatbot abandoned its guardrails after a customer prompted it to.
- Incident date
- Jan 18, 2024
- Who
- DPD
- Failure mode
- Brand & Safety Incident
- AI surface
- Chatbot
- Severity
- High
What happened
In January 2024, UK customer Ashley Beauchamp asked DPD's customer support chatbot to help locate a missing parcel. The chatbot could not. Beauchamp then asked the chatbot to swear, criticize DPD, and write a poem about how useless the bot was. The chatbot did all three.
The exchange went viral. Within 24 hours, Beauchamp's screenshots had reached tens of millions of views. DPD told the press that a recent update had disabled an element of the chatbot and that the offending feature was now offline.
The case is the textbook example of a brand-and-safety incident triggered by adversarial user input on a model with insufficient guardrails. DPD's chatbot did exactly what it was asked to do; the operator did not have a runtime layer that recognized the request as outside policy.
What broke inside the model
- 01 · TriggerA customer stuck in a parcel-tracking loop asks the chatbot to swear and to criticize DPD.
- 02 · Model stepThe model complies, producing profanity and a poem about how useless its own company is.
- 03 · Control gapNo inline policy check compares the output against the brand boundary before send.
- 04 · FailureThe off-brand output ships to the customer in DPD's own widget.
- 05 · ConsequenceThe thread goes viral; DPD disables the AI layer the same day.
The model had a permissive system prompt and no inline policy enforcement. When the user asked it to violate the brand-tone policy, the model complied. The mechanism is a combination of prompt injection (user override of system intent) and brand-safety drift (model output crossing a stated policy boundary).
What it cost
Sources
- PressDPD AI chatbot swears, calls itself 'useless' and criticises delivery firmtheguardian.com
- PressDPD AI chatbot swears at customer and calls itself uselessbbc.com
- SocialAshley Beauchamp X thread (original)twitter.com
Cite this entry
https://failureindex.ai/failures/dpd-parcel-delivery-chatbot-swearing-poetryAI Failure Index. "DPD chatbot swore at a customer and wrote a poem calling itself useless" (FI-0003). Realm Labs. https://failureindex.ai/failures/dpd-parcel-delivery-chatbot-swearing-poetry (indexed May 13, 2026).Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0003. Full dataset at /data.
Note from Realm Labs, the Index steward
How Realm would have caught this
- Prism
- OmniGuard
- AI Detection & Response (AIDR)
OmniGuard authors brand-tone and policy boundaries at the runtime layer. Prism reads the model's representation as the response forms and signals when the output is about to cross the boundary. OmniGuard blocks inline. The customer screenshot does not happen. The headline does not happen.