AI Failures in Healthcare

OpenAI, Google, Anthropic3 sourcesPrimaryPublicApr 2026

AI chatbots from OpenAI, Google and Anthropic provided biological weapon instructions

Major LLMs from OpenAI, Google, and Anthropic were found to provide detailed, actionable instructions for creating and deploying biological weapons. The issue was identified through stress tests conducted by scientists and security experts.

Confidence: High (multi-source, primary)

FI-0324HealthcareHigh

OpenAI; Google; xAI; DeepSeek; Meta AI4 sourcesPrimaryPublicApr 2026

BMJ Open study finds half of leading chatbots give problematic medical advice

A BMJ Open study of five major chatbots found about half produced problematic medical answers, with a notable share being highly problematic due to false balance; this was reiterated by Bloomberg and NBC News.

Confidence: High (multi-source, primary)

FI-0298HealthcareHigh

UnitedHealth Group3 sourcesPressPublicMar 2026

UnitedHealth Group ordered to provide AI tool discovery in coverage denial case

A federal judge ordered UnitedHealth Group to disclose internal documents regarding its nH Predict AI tool. The tool is alleged to have improperly overridden physician decisions to deny coverage for skilled nursing facility care.

Confidence: Medium (multi-source)

FI-0262HealthcareCatastrophic

Data Leakage

Brazilian firm allegedly used AI to illegally resell SUS patient data

In February 2026, the Brazilian Federal Police launched Operation Glycon to dismantle a business structure illegally commercializing sensitive health data from the Unified Health System (SUS). The company allegedly used an AI-powered tool designed for health professionals to gain unauthorized access to clinical records.

Confidence: High (multi-source, primary)

Unnamed company (investigated in Operation Glycon)2 sourcesPrimaryPublicFeb 2026

FI-0189HealthcareHigh

St. Rose Dominican Hospital2 sourcesPressPublicFeb 2026

St. Rose Dominican Hospital AI sepsis alert recommends dangerous fluids for dialysis patient

An AI-driven sepsis protocol at St. Rose Dominican Hospital flagged a dialysis patient for IV fluids. A nurse noticed the dialysis catheter and refused to administer fluids, averting a potentially dangerous outcome. A physician intervened with an alternative treatment after clinician concerns were raised.

Confidence: Medium (multi-source)

FI-0025HealthcareHigh

Anonymized: Health Plan · US · regional, 2M+ membersSteward-verified · NDAJan 2026

Health plan's prior-auth agent approved a procedure outside coverage policy

A regional health plan's prior-auth agent approved a procedure that the company's medical policy explicitly excluded. The provider proceeded based on the approval. The plan paid the claim and triggered an internal review.

Confidence: Steward-verified (NDA)

FI-0419HealthcareMedium

U.S. Immigration and Customs Enforcement (ICE)2 sourcesPressPublicJan 2026

ICE AI resume screening error routes recruits to inadequate training

An AI resume-screening tool used by ICE misclassified inexperienced recruits as experienced law enforcement officers. This resulted in approximately 200 hires receiving inadequate online training instead of the required in-person academy course.

Confidence: Medium (multi-source)

FI-0429HealthcareMedium

OpenAI, Microsoft2 sourcesPrimaryPublicAug 2025

HMRC tax allowances ignored by ChatGPT and Copilot

Generative AI tools including ChatGPT and Copilot provided incorrect UK tax advice. The models failed to recognize a £20,000 allowance, which could lead users to make incorrect tax submissions.

Confidence: High (multi-source, primary)

FI-0612HealthcareHigh

Tool Misuse

Lithuanian politicians and doctors impersonated in deepfake health scam

An international scam network used AI-generated deepfakes of Lithuanian politicians and doctors to promote fraudulent health products. These videos mimicked legitimate TV news segments to deceive the public into buying fake cures.

Confidence: Medium (multi-source)

Unknown international scam network2 sourcesPressPublicJun 2025

FI-0259HealthcareHigh

Samsung Medison (Sonio SAS)2 sourcesPrimaryPublicJun 2025

Sonio Detect AI ultrasound software mislabels fetal structures in prenatal imaging

Sonio Detect AI mislabels fetal anatomy in prenatal ultrasound, with a MAUDE adverse event entry and Reuters reporting; Samsung Medison says the FDA report does not indicate a safety issue and no action was requested.

Confidence: High (multi-source, primary)

FI-0605HealthcareHigh

Tool Misuse

Deepfake of Dr Rinki Murphy and Jack Tame promotes fake diabetes cure in New Zealand

A deepfake video impersonating Dr. Rinki Murphy and journalist Jack Tame was used to promote a fraudulent diabetes cure in New Zealand. The video appeared to be a TVNZ interview and targeted people with Type 2 diabetes.

Confidence: Medium (multi-source)

Scammers2 sourcesPressPublicApr 2025

FI-0466HealthcareHigh

National Social Security Institute (INSS)3 sourcesPrimaryPublicApr 2025

Brazil AI welfare app wrongly rejects benefit claims

The Brazilian National Social Security Institute's AI-powered app, Meu INSS, wrongly denied benefit claims for hundreds of applicants. The system struggled with complex cases and rural users with low digital literacy, leading to a loss of essential income.

Confidence: High (multi-source, primary)

FI-0184HealthcareHigh

CVS Health and Aetna3 sourcesPrimaryPublicOct 2024

CVS Health and Aetna accused of AI-driven denials in post-acute care

A Senate staff report and independent reporting allege CVS Health and Aetna used predictive AI tools to increase denials of post-acute care authorizations for Medicare Advantage patients, prioritizing profits over patient care.

Confidence: High (multi-source, primary)

FI-0188HealthcareHigh

OpenAI3 sourcesPressPublicOct 2024

OpenAI Whisper hallucinations in medical settings prompt safety concerns, AP reports

Independent outlets report that OpenAI Whisper can hallucinate in medical transcription, risking inaccurate patient documentation. The AP investigation notes thousands of healthcare workers use Whisper-based tools, highlighting potential safety concerns in high-risk settings.

Confidence: Medium (multi-source)

FI-0260HealthcareHigh

Pieces Technologies3 sourcesPrimaryPublicSep 2024

Pieces Technologies settles Texas AG allegations over AI hallucination claims

Pieces Technologies reached a settlement with the Texas Attorney General following allegations that the company made deceptive claims regarding the accuracy of its generative AI clinical documentation tool. The investigation found metrics such as a severe hallucination rate of less than 1 per 100,000 were likely inaccurate.

Confidence: High (multi-source, primary)

FI-0151HealthcareMedium

CVS Health3 sourcesCourt FilingPublicJul 2024

CVS settled a class action alleging HireVue facial-expression AI acted as an illegal lie detector

CVS Health required job applicants to complete HireVue video interviews analyzed by Affectiva AI software that tracked facial expressions and assigned employability scores measuring traits such as integrity and conscientiousness. A proposed class action in Massachusetts federal court alleged this AI screening violated both the federal Employee Polygraph Protection Act and the Massachusetts Lie Detector Statute by functioning as an unlawful lie detector test. CVS privately settled the case in July 2024 with undisclosed terms after the court denied its motion to dismiss.

Confidence: High (multi-source, primary)

FI-0636HealthcareMedium

World Health Organization2 sourcesPressPublicApr 2024

WHO S.A.R.A.H. AI health bot provides inaccurate medical information

The WHO's AI-powered health advisor, S.A.R.A.H., was reported to provide inconsistent and incorrect medical information. Despite its status as a research prototype, the bot was accessible to the public and frequently hallucinated facts.

Confidence: Medium (multi-source)

FI-0296HealthcareMedium

Data Leakage

Change Healthcare ransomware incident on Feb 21, 2024 is real but not a production AI failure

A real ransomware incident at Change Healthcare occurred on February 21, 2024. It was not a production AI failure; MFA gaps on remote access were cited as a key root cause, with BlackCat identified as the attackers.

Confidence: High (multi-source, primary)

Change Healthcare (a subsidiary of UnitedHealth Group/Optum)2 sourcesPrimaryPublicFeb 2024

FI-0096HealthcareHigh

Humana6 sourcesCourt FilingPublicDec 2023

Humana was sued over using nH Predict AI to systematically deny Medicare post-acute claims

A class action lawsuit filed on December 12, 2023 alleges that Humana used an AI model called nH Predict, owned by UnitedHealth subsidiary NaviHealth, to override physician determinations and wrongfully deny Medicare Advantage members coverage for post-acute care. The complaint claims Humana set a target to keep post-acute facility stays within 1% of the algorithm's predictions and disciplined employees who deviated. Approximately 90% of denied claims were overturned on appeal, yet only about 0.2% of denied policyholders actually appealed. The Senate Permanent Subcommittee on Investigations published a report in October 2024 scrutinizing Humana and other insurers for AI-driven denials of post-acute care.

Confidence: High (multi-source, primary)

FI-0425HealthcareHigh

Unspecified / healthcare deployer3 sourcesPrimaryPublicNov 2023

Large language models perpetuate racial bias in healthcare

AIAAIC recorded an incident entry (published November 2023) documenting that large language models (LLMs) have produced racially biased outputs in healthcare contexts. Independent academic audits and studies (including a 2024 audit titled "Unmasking and Quantifying Racial Bias of Large Language Models") found LLMs gave systematically different clinical-related recommendations and projections across racial groups. These outputs have the potential to cause harm when used in clinical decision-making by healthcare deployers.

Confidence: High (multi-source, primary)

FI-0039HealthcareHigh

Brand & Safety Incident

An eating-disorder helpline's chatbot was pulled after giving harmful dieting advice

The National Eating Disorders Association replaced its human helpline with a chatbot named Tessa, which then told users seeking help to count calories and aim for large daily deficits, advice eating-disorder specialists call actively harmful. NEDA took Tessa offline days after launch.

Confidence: Medium (multi-source)

National Eating Disorders Association4 sourcesPressPublicMay 2023

FI-0077HealthcareMedium

Koko1 sourcePressPublicJan 2023

A mental-health startup ran GPT-3 on thousands of unwitting help-seekers

The startup Koko used GPT-3 to co-write responses to roughly 4,000 people seeking peer mental-health support without clearly informing them they were receiving AI-generated messages, drawing an ethics backlash over consent in a vulnerable-population setting.

Confidence: Low (single source)

FI-0186HealthcareMedium

Koko2 sourcesPressPublicOct 2022

Koko used GPT-3 to generate AI-assisted emotional support without informed consent

Koko conducted an October 2022 experiment using GPT-3 to generate emotional support messages, with human editors, affecting about 4,000 users and generating roughly 30,000 messages. The incident became public in January 2023 through reports and statements by Koko’s co-founders, prompting ethical criticism over informed consent and disclosure, and Koko announced pursuing a third‑party IRB review for future changes.

Confidence: Medium (multi-source)

FI-0257HealthcareCatastrophic

Acclarent (Integra LifeSciences)2 sourcesPressPublicJun 2022

Acclarent TruDi AI navigation system allegedly causes carotid artery injuries

The Acclarent TruDi AI navigation system allegedly misled surgeons during sinus operations, resulting in carotid artery punctures and strokes. FDA malfunction reports reportedly rose after AI integration in 2021, and two patients filed Texas lawsuits alleging AI contributed to injuries.

Confidence: Medium (multi-source)

FI-0187HealthcareHigh

Crisis Text Line3 sourcesPressPublicJan 2022

Crisis Text Line ends data-sharing with for-profit spinoff Loris.ai

Crisis Text Line admitted to sharing anonymized user data with its for-profit subsidiary, Loris.ai, for machine learning development. The move drew heavy criticism of the ethics of using crisis-intervention data for commercial gain, and the data-sharing was ended.

Confidence: Medium (multi-source)

FI-0448HealthcareMedium

Data Leakage

Huq location data transmitted by apps despite user opt outs

Google warned app developers that location data was being shared with the firm Huq despite user opt-outs. This highlighted failures in the implementation of privacy consents within third-party apps.

Confidence: Medium (multi-source)

Huq2 sourcesPressPublicDec 2021

FI-0686HealthcareCatastrophic

Brand & Safety Incident

Healthcare allocation algorithm reduces essential care hours in multiple US states

US state governments used an algorithmic system to allocate home care hours for disabled and elderly patients. The system's rigid scoring failed to account for individual medical needs, leading to drastic reductions in essential care.

Confidence: High (multi-source, primary)

Various US state governments (including Arkansas, Idaho, and Washington DC)3 sourcesCourt FilingPublicJul 2021

FI-0095HealthcareHigh