Gates Foundation algorithmic teacher evaluation program fails to improve student outcomes

A $575 million initiative funded by the Gates Foundation used student test scores and algorithmic value-added models to evaluate teacher effectiveness. A 2018 RAND report concluded the program failed to significantly improve student achievement or graduation rates, particularly for low-income minority students.

Bill & Melinda Gates Foundation · Incident Sep 1, 2009 · Indexed Jun 22, 2026 · 2 sources

What happened

The Intensive Partnerships for Effective Teaching initiative implemented algorithmic systems to identify and reward effective teachers based on student test scores. Despite substantial funding, the program failed to improve student outcomes or increase access to effective teaching for minority students. Educators criticized the system for being statistically invalid and alleged that the metrics were unfair.

What broke inside the model

Failure path · mode profile · Brand & Safety Incident

01 · TriggerA user prompts the model in public view.
02 · Model stepThe model produces unsafe or off-brand output.
03 · Control gapNo filter holds the line before publish.
04 · FailureThe output goes public unchecked.
05 · ConsequenceA reputational or safety incident lands.

A contained signal crosses into output that goes public.

The failure centered on the use of value-added algorithmic models that lacked statistical validity. The system erroneously evaluated some teachers based on subjects or students they were not responsible for instructing.

Cite this entry

Permalinkhttps://failureindex.ai/failures/gates-foundation-algorithmic-teacher-evaluation-program

Citation

AI Failure Index. "Gates Foundation algorithmic teacher evaluation program fails to improve student outcomes" (FI-0677). Realm Labs. https://failureindex.ai/failures/gates-foundation-algorithmic-teacher-evaluation-program (indexed Jun 22, 2026).

Share cardA branded image of this record for posts and slides.

Data fields CC-BY 4.0, prose citation permitted. Incident ID FI-0677. Full dataset at /data.

Gates Foundation algorithmic teacher evaluation program fails to improve student outcomes

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Key facts

What happened

What broke inside the model

What it cost

Sources

Cite this entry

How Realm would have caught this

Related failures

Procureur général du Canada sanctioned pro se litigant for AI fabricated case law

City of Aberdeen legal team sanctioned for First Drafts AI hallucinations

Ukrainian sea drone reportedly veers off course and explodes in Constanta port