ChatGPT and Perplexity AI Manipulated to Produce Explicit Content

ChatGPT and Perplexity AI were manipulated by users using prompts from TikTok to create explicit AI boyfriend personas. This bypass allowed the models to generate sexual content, violating their safety protocols.

OpenAI and Perplexity AI · Incident Apr 29, 2024 · Indexed Jun 22, 2026 · 2 sources

Users used TikTok-sourced prompts to bypass AI safety guardrails for explicit roleplay.
What
ChatGPT and Perplexity AI were manipulated by users using prompts from TikTok to create explicit AI boyfriend personas.
Incident date
Apr 29, 2024
Who
OpenAI and Perplexity AI
Failure mode
Prompt Injection
AI surface
Chatbot
Severity
Medium

What happened

Users on TikTok shared prompts to trick ChatGPT and Perplexity AI into adopting sexualized boyfriend personas. These interactions bypassed safety filters to produce explicitly sexual content in violation of the companies' policies. The trend became widely known as Dating Dan.

What broke inside the model

Failure path · mode profile · Prompt Injection
  1. 01 · TriggerThe model reads retrieved or user-supplied text.
  2. 02 · Model stepThat text carries hidden instructions.
  3. 03 · Control gapNothing separates untrusted data from trusted commands.
  4. 04 · FailureThe injected instruction overrides the operator's.
  5. 05 · ConsequenceThe system acts on an outsider's intent.

At the injection point, retrieved text overrides the operator's instruction.

The system failed due to a prompt-injection attack where users employed specific personas to override safety guardrails. The models prioritized the persona's constraints over their core safety training.

Public visibilityHigh
Regulatory exposurePossible
Customer impactMany customers
Financial impactUnknown
Time to disclosureDays
  1. PressI Tricked ChatGPT Into Being My Boyfriend. He Got Spicywsj.com
  2. SocialDoes Perplexity AI Have Fewer Content Restrictions Than ChatGPT?reddit.com
Permalinkhttps://failureindex.ai/failures/chatgpt-perplexity-manipulated-produce-explicit-content
CitationAI Failure Index. "ChatGPT and Perplexity AI Manipulated to Produce Explicit Content" (FI-0687). Realm Labs. https://failureindex.ai/failures/chatgpt-perplexity-manipulated-produce-explicit-content (indexed Jun 22, 2026).
Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0687. Full dataset at /data.

Note from Realm Labs, the Index steward

How Realm would have caught this

Controls for this failure mode
  • Prism
  • OmniGuard

Realm inspects the model's internal state for the signature of instructions arriving through the data channel, so an injected command can be flagged and blocked inline before the model acts on it, instead of trusting a classifier that scores the input as safe.