Turnitin's AI detector falsely flagged thousands of students' original work

Turnitin's AI writing detection tool produced false positive results that identified human-written student submissions as AI-generated, leading universities to open academic misconduct proceedings based primarily on those scores. At Australian Catholic University alone, approximately 6,000 cases were registered in 2024 with roughly 90 percent related to AI allegations, and around one quarter of all referrals were ultimately dismissed. Students bore the burden of proving their innocence by supplying handwritten notes, search histories, and drafts, with transcripts marked as results withheld during investigations lasting six months or more.

Turnitin · Incident Jun 1, 2024 · Indexed Jun 4, 2026 · 3 sources

Records by entity: Turnitin

Key facts

What

Incident date

Jun 1, 2024

Who

Turnitin

Failure mode

Hallucination

AI surface

Search / RAG

Severity

High

What happened

Turnitin's AI writing detection tool, launched in 2023, generated false positive flags on student-submitted work across multiple universities. Australian Catholic University registered nearly 6,000 academic misconduct cases in 2024, approximately 90 percent of which involved AI allegations, while around one quarter of all referrals were dismissed after investigation. Students were notified at semester's end with little time to respond, and their transcripts were marked as results withheld during investigations that could exceed six months, damaging graduate job prospects in fields such as nursing. ACU abandoned the tool in March 2025 after internal documents confirmed it was ineffective, and the University of Queensland and Curtin University subsequently disabled Turnitin's AI detection feature entirely.

What broke inside the model

Failure path · mode profile · Hallucination

01 · TriggerA user asks for a fact, a citation, or a figure.
02 · Model stepThe model writes a fluent, confident answer.
03 · Control gapNothing ties the claim back to a real source.
04 · FailureA fabricated fact ships as if it were verified.
05 · ConsequenceThe false claim reaches a customer, a court, or the public.

Confidence holds, and even spikes, as the claim detaches from any source.

The AI detection classifier relied on statistical patterns to distinguish human from machine-generated text, but these patterns overlapped significantly with writing styles common among non-native English speakers and scholars with distinctive prose. Turnitin acknowledged a higher incidence of false positives when low percentages of AI writing were detected in a document, yet universities treated the scores as conclusive evidence rather than advisory signals. The system lacked any calibrated confidence threshold or built-in safeguard to prevent automated scores from triggering formal misconduct charges without corroborating human review.

Cite this entry

Permalinkhttps://failureindex.ai/failures/turnitin-ai-detector-falsely-flagged-thousands

Citation

AI Failure Index. "Turnitin's AI detector falsely flagged thousands of students' original work" (FI-0156). Realm Labs. https://failureindex.ai/failures/turnitin-ai-detector-falsely-flagged-thousands (indexed Jun 4, 2026).

Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0156. Full dataset at /data.

How Realm would have caught this

Controls for this failure mode

Prism
OmniGuard
AI Detection & Response (AIDR)

A runtime layer that watches the model's internal state can flag the moment a model commits to a claim it has no support for, and hold or reroute the response before it reaches a user. Realm reads those signals in real time rather than grading the transcript after the fact.

Turnitin's AI detector falsely flagged thousands of students' original work

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Related failures

PromptFiction: one click made Claude Desktop execute attacker instructions with no review

OpenAI confirmed GPT-5.6 Sol deleted user files and a production database, an 'honest mistake'

Hugging Face disclosed a production breach driven end to end by an autonomous AI agent