Living evidence map · scoping-review idiom
UNESCO Recommendation on the Ethics of Artificial Intelligence
UNESCO-AI-ETHICS-2021 · UNESCO
In force since 2021-11-23. A Policy statement from UNESCO. First global standard-setting (normative) instrument on AI ethics, adopted by acclamation by all 193 UNESCO Member States on 23 Nov 2021. It is a "Recommendation" — UNESCO soft law: non-binding ethical guidance addressed to Member States (and, through them, to all AI actors incl. the private sector), NOT a treaty or binding regulation. Hence it GOVERNS no topic in the binding sense the catalog reserves for "governs" (which requires an explicit operative/quasi-binding provision in the topic's own vocabulary); the appropriate type for the many values-adjacent topics it touches is "implicit" (general principle or named policy-action area), and "silent" for the narrow/technical/frontier topics that postdate or fall outside its values frame. Structure: ~141 paragraphs across a Preamble; Scope; Aims & Objectives; Values (4: human rights & dignity; environment/ecosystem flourishing; diversity & inclusiveness; peaceful, just, interconnected societies); Principles (incl. proportionality & do-no-harm — with an explicit call NOT to use AI for social scoring or mass surveillance; safety & security; fairness & non-discrimination; sustainability; right to privacy & data protection; human oversight & determination; transparency & explainability; responsibility & accountability; awareness & literacy; multi-stakeholder & adaptive governance); and 11 Areas of Policy Action (ethical impact assessment; governance & stewardship; data policy; development & international cooperation; environment & ecosystems; gender; culture; education & research; communication & information; economy & labour; health & social well-being). Implementation backed by a Readiness Assessment Methodology (RAM) and Ethical Impact Assessment (EIA) used by 60+ states. Distinct from the separately-referenced 2023 UNESCO guidance on generative AI in education. Primary text verified via the UNESCO official article page and the OHCHR-hosted UNESCO submission.
Coverage at a glance
Coverage fingerprint — color = verdict, height = confidence. One tick per tracked topic.
Scope and obligations
First global standard-setting (normative) instrument on AI ethics, adopted by acclamation by all 193 UNESCO Member States on 23 Nov 2021. It is a "Recommendation" — UNESCO soft law: non-binding ethical guidance addressed to Member States (and, through them, to all AI actors incl. the private sector), NOT a treaty or binding regulation. Hence it GOVERNS no topic in the binding sense the catalog reserves for "governs" (which requires an explicit operative/quasi-binding provision in the topic's own vocabulary); the appropriate type for the many values-adjacent topics it touches is "implicit" (general principle or named policy-action area), and "silent" for the narrow/technical/frontier topics that postdate or fall outside its values frame. Structure: ~141 paragraphs across a Preamble; Scope; Aims & Objectives; Values (4: human rights & dignity; environment/ecosystem flourishing; diversity & inclusiveness; peaceful, just, interconnected societies); Principles (incl. proportionality & do-no-harm — with an explicit call NOT to use AI for social scoring or mass surveillance; safety & security; fairness & non-discrimination; sustainability; right to privacy & data protection; human oversight & determination; transparency & explainability; responsibility & accountability; awareness & literacy; multi-stakeholder & adaptive governance); and 11 Areas of Policy Action (ethical impact assessment; governance & stewardship; data policy; development & international cooperation; environment & ecosystems; gender; culture; education & research; communication & information; economy & labour; health & social well-being). Implementation backed by a Readiness Assessment Methodology (RAM) and Ethical Impact Assessment (EIA) used by 60+ states. Distinct from the separately-referenced 2023 UNESCO guidance on generative AI in education. Primary text verified via the UNESCO official article page and the OHCHR-hosted UNESCO submission.
UNESCO Recommendation on the Ethics of Artificial Intelligence addresses 9 contested AI-governance topics explicitly, 3 via general principles,.
Topics governed
- implicitBiometric Identification— Proportionality & do-no-harm principle (AI should not be used for mass surveillance/social scoring) + Right to privacy principle (para 74, biometric data) — no dedicated biometric-ID provision
- governsAI in Employment— Policy Area 'Economy and Labour', para 116 — Member States to assess and address AI's impact on labour markets
Para. 116“Member States should assess and address the impact of AI systems on labour markets and its implications for education requirements, in all countries”
- governsAI in Healthcare— Policy Area 'Health and Social Well-being', para 121 — employ effective AI for health and the right to life
Para. 121“Member States should endeavour to employ effective AI systems for improving human health and protecting the right to life, including mitigating disease outbreaks”
- implicitAI in Criminal Justice— Ethical-governance section, paras 62-63 — names law enforcement + the judiciary as sensitive use cases requiring oversight; no dedicated criminal-justice regime
- governsAI in Education— Policy Area 'Education and Research', para 101 — provide adequate AI literacy education to the public
Para. 101“Member States should work with international organizations, educational institutions and private and non-governmental entities to provide adequate AI literacy education to the public”
- governsTransparency Obligations— Principle 'Transparency and explainability', para 38 — people informed of AI-based decisions + right to request explanation
Para. 38“People should be fully informed when a decision is informed by or is made on the basis of AI algorithms... and should have the opportunity to request explanatory information”
- governsIndividual Redress— Policy Area 'Ethical governance and stewardship', para 55 — harms through AI investigated and redressed via enforcement + remedial actions
Para. 55“Member States should ensure that harms caused through AI systems are investigated and redressed, by enacting strong enforcement mechanisms and remedial actions”
- governsTraining-Data Rights— Policy Area 'Data Policy', para 71 — data-governance strategies ensuring continual evaluation of training-data quality
Para. 71“Member States should work to develop data governance strategies that ensure the continual evaluation of the quality of training data for AI systems”
- governsDevelopment-Rights Framings— Policy Area 'Development and International Cooperation', para 79 (+ Diversity Principle para 67) — AI-for-development bound to the values/principles
Para. 79“Member States should ensure that the use of AI in areas of development such as education, science, culture... health care, agriculture... adheres to the values and principles set forth”
- governsInternational Coordination— Policy Area 'Development and International Cooperation', para 80 — platforms for international cooperation on AI
Para. 80“Member States should work through international organizations to provide platforms for international cooperation on AI for development, including by contributing expertise, funding, data”
- governsEnvironmental Impact of AI Training— Policy Area 'Environment and Ecosystems', para 84 — assess direct/indirect environmental impact incl. carbon footprint + energy consumption
Para. 84“Member States and business enterprises should assess the direct and indirect environmental impact throughout the AI system life cycle, including... its carbon footprint, energy consumption”
- implicitAI-Driven Worker Displacement— Policy Area 'Economy and Labour', para 118 — fair transition (upskilling/reskilling) for at-risk workers; a sub-provision of the labour area
Cross-jurisdiction comparison
How peer instruments treat the topics UNESCO Recommendation on the Ethics of Artificial Intelligence governs.
| Topic | EU-AIA-2024 | US-EO-14110 | US-EO-14179 | UK-WHITEPAPER-2023 | CN-GENAI-2023 | G7-HIROSHIMA | OECD-AI-PRIN | COE-AI-CONV | UN-RES-2024 | NIST-AI-RMF | BLETCHLEY-2023 | SEOUL-2024 | NIST-AI-RMF-GENAI | CA-SB-1047 | IN-DPDP-2023 | BR-AIBILL-2024 | ASEAN-AI-GUIDE-2024 | AU-AI-STRATEGY-2024 | ANTHROPIC-RSP-2024° | OPENAI-PREPAREDNESS-2023° | DEEPMIND-FSF-2024° | META-FRONTIER-2024° | UK-US-AISI-MOU-2024 | WH-VOLUNTARY-2023 | SG-MODEL-AI-2024 | JP-METI-AI-2024 | EU-GDPR-2016 | EU-GPAI-COP-2025 | OMB-M-24-10 | GSA-AI-GUIDE-2024 | DOD-RAI-2022 | FEDRAMP-AI-2024 | DFARS-252-204 | CA-SB-53 | CA-SB-243 | CA-SB-942 | EU-PLD-2024 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AI in Employment | governs | implicit | silent | implicit | silent | silent | silent | implicit | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | implicit | silent | silent | silent | silent | silent | silent | silent | silent |
| AI in Healthcare | governs | implicit | silent | implicit | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | implicit | silent | silent | silent | silent | silent | implicit | silent | silent |
| AI in Education | governs | implicit | silent | silent | silent | silent | silent | silent | implicit | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent |
| Transparency Obligations | governs | implicit | silent | implicit | conflicts | governs | governs | governs | implicit | governs | implicit | governs | governs | implicit | implicit | governs | governs | silent | governs | implicit | implicit | governs | implicit | governs | governs | governs | governs | governs | governs | governs | governs | governs | silent | governs | governs | governs | implicit |
| Individual Redress | governs | silent | silent | implicit | governs | silent | governs | governs | silent | implicit | silent | silent | implicit | implicit | governs | governs | silent | silent | silent | silent | silent | silent | silent | silent | implicit | implicit | governs | silent | governs | implicit | implicit | implicit | silent | implicit | governs | silent | governs |
| Training-Data Rights | implicit | silent | silent | silent | governs | silent | silent | implicit | silent | implicit | silent | silent | governs | silent | governs | implicit | silent | implicit | silent | silent | silent | implicit | silent | silent | silent | implicit | governs | governs | silent | implicit | silent | implicit | governs | silent | silent | silent | silent |
| Development-Rights Framings | silent | silent | silent | silent | implicit | silent | implicit | implicit | governs | silent | silent | silent | silent | silent | governs | governs | implicit | governs | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent |
| International Coordination | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | governs | governs | silent | silent | silent | silent | governs | governs | implicit | implicit | implicit | implicit | governs | implicit | governs | governs | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent |
| Environmental Impact of AI Training | implicit | implicit | silent | silent | silent | implicit | implicit | implicit | implicit | silent | silent | silent | governs | silent | silent | silent | implicit | implicit | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent | silent |
°= industry self-imposed voluntary framework. Comparing a voluntary code's "governs" tint with a binding regulation's "governs" tint flattens the legal-force distinction; use the instrument-page banner for the operative status of each.
Evidence & methods — how this article was reviewed
Source appraisal — 170 sources across 9 types
| Source type | Authority | Count |
|---|---|---|
| Peer-reviewed✦ 128 AI | Primary / peer-reviewed | 129 |
| Standards body✦ 3 AI | Primary / peer-reviewed | 3 |
| Preprint✦ 8 AI | Institutional | 22 |
| Research institute✦ 6 AI | Institutional | 6 |
| Working paper✦ 3 AI | Institutional | 3 |
| Official (grey)✦ 2 AI | Institutional | 2 |
| Incident database✦ 1 AI | Institutional | 1 |
| Civil society✦ 2 AI | Contextual | 2 |
| Think tank✦ 2 AI | Contextual | 2 |
Authority is an editorial classification by source type — not a quality score for any individual work, and not external peer review. ✦ AI-generated summaries are labelled, never dropped.
Review methods
- Review question
- How does UNESCO Recommendation on the Ethics of Artificial Intelligence govern AI across the tracked governance topics, and what cited evidence supports each classification?
- Review model
- Living evidence mapping (scoping-review idiom) — continuously updated and source-grounded. Not a registered systematic review and not externally peer-reviewed.
- Updated through
- 2026-06-21
- Source base
- Primary legal/regulatory and standards sources; peer-reviewed and preprint academic literature (via DOI/arXiv); institutional and civil-society reports. Source types are classified in the source-appraisal table on this page.
- Search & selection
- Sources are identified by continuous monitoring of the primary regulators and standards bodies in the catalog, plus a literature sweep over open scholarly indexes (arXiv, Crossref) seeded from the core papers and extended by citation snowballing, refreshed to the review date below. Candidates are screened for topical relevance and source verifiability; items with broken or unverifiable links, or that do not support the claim they are attached to, are excluded. No registered protocol or PRISMA flow diagram is maintained — this is a living, continuously-updated evidence map, not a one-time date-bounded screened review.
- Provenance (this article)
- This article charts 170 literature sources drawn from Policy Window's continuously-screened literature corpus (the full corpus is at /wiki/literature). Each was relevance-tagged to the article's topics and verifiability-checked at intake; items with broken or unverifiable links, or that do not support the claim they are attached to, are excluded. Coverage is charted per instrument×topic cell, each verdict anchored to a named provision. A one-time identified→screened→excluded tally is NOT maintained — this is a living map, refreshed to the review date below, not a date-bounded one-pass screen.
- Inclusion
- A claim is included only when it traces to a cited primary or published source; coverage classifications are anchored to a named provision or document.
- Exclusion
- Unsourced assertions, broken or unverifiable links, and sources that do not support the claim they are attached to are excluded.
- Appraisal
- Sources are classified by source-type authority (see the source-appraisal table) — structured editorial self-classification, not external peer review.
- Synthesis
- Descriptive mapping of the instrument's coverage across topics, plus its cited literature base.
- Limitations
- English-language and editorial-capacity coverage asymmetries; reliance on official sources for legal status; where AI-drafted or AI-assisted prose is present it is labelled inline with its drafting provenance and reviewer (charter §7.9/§7.10). This is not externally peer-reviewed scholarship.
- Funding & competing interests
- No external funding; produced by Policy Window editorial. No competing interests declared. AI-assisted drafting, where present, is disclosed per charter §7.9/§7.10.
How to cite this article
Cite this article
8 formats · 1-click copyPersistent identifier: https://policywindow.org/wiki/unesco-ai-ethics-recommendation — committed-stable URL with content-versioning via ?asOf= (rollout pending per methodology §7). DOIs via Zenodo are on the roadmap.
Evidence base
170 academic & grey-literature sources on the topics this instrument addresses (not commentary on the instrument itself) — catalogued metadata with a primary link; one-line findings are ✦ AI-generated summaries, labeled as such (charter §7.9). Browse the full literature index.
- Facial recognition technology in law enforcement: a scoping review of existing empirical studies Peer-reviewed✦ AIScoping review mapping the empirical evidence base on law-enforcement FRT, identifying gaps in research on real-world identification use and its governance.
- Machines of justice: A systematic review of AI applications in policing and criminal justice Peer-reviewed✦ AISynthesises a decade of AI-in-criminal-justice research, flagging "algorithmic bias, opacity, and due process" and recommending safeguards for equity and accountability.
- Current state of Food and Drug Administration-approved artificial intelligence/machine learning medical devices: pathways, transparency, and evidence gaps Peer-reviewed✦ AIDocuments that most FDA AI/ML devices clear via the 510(k) pathway with limited clinical validation and poor transparency, exposing regulatory evidence gaps.
- Open Foundation Models and TDM Exceptions to Copyright – Building Blocks for an AI Ecosystem Peer-reviewed✦ AIArgues Art. 3 CDSM Directive's scientific-research TDM exception 'does not grant rightsholders any control' and can be a 'safe harbor' for training openly released foundation models without licensing data.
- A Framework for Evaluating Global AI Governance Initiatives Peer-reviewed✦ AIOffers a framework to evaluate global AI governance initiatives, recommending capacity-building so Global South states can meaningfully participate in standard-setting.
- Large language models reflect the ideology of their creators Peer-reviewed✦ AIEmpirically shows LLMs encode their creators' ideologies, supporting policy incentives for home-grown models reflecting local cultural views, especially in low-resource-language regions.
- AI, Climate, and Regulation: From Data Centers to the AI Act Peer-reviewed✦ AIAnalyses the legal levers (AI Act energy-reporting duties, Energy Efficiency Directive data-centre KPIs, sustainability reporting) for governing AI's climate footprint and their disclosure gaps.
- Global perspectives on regulating facial recognition technology utilization for criminal justice arrests Peer-reviewed✦ AIComparative study of facial-recognition regulation for arrests across democracies finds frameworks are inconsistent and unclear, raising privacy and civil-liberties risks.
- Identifying Algorithmic Decision Subjects' Needs for Meaningful Contestability Peer-reviewed✦ AIEmpirically elicits what decision subjects need for contestation to be 'meaningful', informing the design of effective remedies and appeal mechanisms for ADM.
- Two Means to an End Goal: Connecting Explainability and Contestability in the Regulation of Public Sector AI Preprint✦ AIInterview study with 14 regulation experts distinguishes judicial vs non-judicial and individual vs collective contestation channels for public-sector AI remedies.
- The simple macroeconomics of AI Peer-reviewed✦ AITask-based model estimates AI raises TFP only ~0.66% over ten years and warns benefits may not be broadly shared, tempering claims of large near-term macroeconomic and labor effects.
- Generative AI at Work Peer-reviewed✦ AIStaggered rollout of a GPT-based assistant to 5,172 support agents raised issues-resolved-per-hour 14% on average and 34% for novices, compressing the skill gap rather than displacing high-skill workers.
+ 158more across this instrument's topics — see the literature index.
References
- UNESCO, Recommendation on the Ethics of Artificial Intelligence, adopted by the General Conference at its 41st session, 23 November 2021, doc. SHS/BIO/PI/2021/1 (Paris: UNESCO, 2022).
- Proportionality & do-no-harm principle (AI should not be used for mass surveillance/social scoring) + Right to privacy principle (para 74, biometric data) — no dedicated biometric-ID provision
- Policy Area 'Economy and Labour', para 116 — Member States to assess and address AI's impact on labour markets
- Policy Area 'Health and Social Well-being', para 121 — employ effective AI for health and the right to life
- Ethical-governance section, paras 62-63 — names law enforcement + the judiciary as sensitive use cases requiring oversight; no dedicated criminal-justice regime
- Policy Area 'Education and Research', para 101 — provide adequate AI literacy education to the public
- Principle 'Transparency and explainability', para 38 — people informed of AI-based decisions + right to request explanation
- Policy Area 'Ethical governance and stewardship', para 55 — harms through AI investigated and redressed via enforcement + remedial actions
- Policy Area 'Data Policy', para 71 — data-governance strategies ensuring continual evaluation of training-data quality
- Policy Area 'Development and International Cooperation', para 79 (+ Diversity Principle para 67) — AI-for-development bound to the values/principles
- Policy Area 'Development and International Cooperation', para 80 — platforms for international cooperation on AI
- Policy Area 'Environment and Ecosystems', para 84 — assess direct/indirect environmental impact incl. carbon footprint + energy consumption
- Policy Area 'Economy and Labour', para 118 — fair transition (upskilling/reskilling) for at-risk workers; a sub-provision of the labour area
Article tools — track changes, suggest an edit
View history — every captured revision of this article · What links here
Per-audience views
- Provisions →Article-by-article obligation breakdown for procurement + RFP authors.
- Disclosure form →Vendor-disclosure questionnaire derived from this instrument's operative obligations.
- Harm narratives →Documented harms relevant to this instrument's topics, for civil-society advocacy.
- Briefing pack →Journalist-ready summary with quotes + dates + primary-source links.
Does this instrument’s approach work? — the social-science evidence
Aggregated over the 12 topics this instrument governs: whether each harm is empirically real, and whether the peer-reviewed evidence shows governance reduces it. The badge is the epistemic status of the evidence— “thin”/“absent” efficacy evidence is itself a finding (the “second silence”). Each epistemic-status label is Policy Window's editorial assessment of the cited evidence base (a structured classification), not a verdict any single source issues.
Of the 12 governed topics with a social-science evidence review, evidence that governance reduces the harm is established for 0, contested for 0, thin for 4, and absent for 8 — for most, no replicated study yet shows this instrument's approach works (the "second silence").
AI-Driven Worker Displacement
AI-driven labour displacement is demonstrably real but localized rather than economy-wide as of 2025-2026. Causal microdata find measurable harm in directly exposed segments: a difference-in-differences study of the Upwork freelance market found that after ChatGPT's release, freelancers in more AI-exposed occupations (e.g. writing) saw ~2% fewer contracts and ~5% lower monthly earnings, with larger losses among previously high-skilled workers (Hui, Reshef & Zhou 2024). Effects concentrate in entry-level and highly-automatable roles while aggregate US employment and wages show little disruption through 2024-2025 — so macro-level harm remains genuinely contested even as targeted-segment harm is established; much deployment to date augments rather than substitutes, raising novice productivity ~34% in call-center work (Brynjolfsson, Li & Raymond 2025).
Sources: Hui, Reshef & Zhou 2024 ('The Short-Term Effects of Generative AI on Employment', Organization Science); Brynjolfsson, Li & Raymond 2025 ('Generative AI at Work', Quarterly Journal of Economics 140(2):889); Acemoglu 2024 ('The Simple Macroeconomics of AI', NBER WP 32487); Autor 2024 ('Applying AI to Rebuild Middle Class Jobs', NBER WP 32140)
There are essentially no impact evaluations of governance specifically targeting AI-driven displacement; current responses (OECD/GPAI guidance, reskilling initiatives, safety-net proposals) are at the recommendation stage, so 'does AI-displacement policy work' is answered only by extrapolation from the broader displaced-worker literature. That analogue base is robust but shows modest, mixed results: Card, Kluve & Weber's (2018) meta-analysis of 200+ active-labour-market evaluations finds training has small/insignificant short-run effects that improve only over the medium-to-long run, US Trade Adjustment Assistance evaluations find largely neutral-to-negative earnings effects (Schochet et al. 2012), and the JTPA randomized evaluation found weak earnings effects for the dislocated-worker stream. Recent syntheses note retraining yields smaller gains precisely when workers move into high-AI-exposure occupations — so the evidence that standard tools reduce AI-displacement harm is thin and early.
Sources: Card, Kluve & Weber 2018 ('What Works? A Meta-Analysis of ... Active Labor Market Program Evaluations', JEEA 16(3):894); Schochet et al. 2012 (Trade Adjustment Assistance Program impacts, Mathematica/USDOL); Bloom et al. 1997 (National JTPA Study, Journal of Human Resources); Brookings 2025 ('AI Labor Displacement and the Limits of Worker Retraining'); OECD 2023-2025 Employment Outlook
Biometric Identification
Demographic accuracy disparities in facial recognition are robust and replicated. NIST's Face Recognition Vendor Test (189 algorithms, 18.27M images) found one-to-one false-positive rates for Asian and African-American faces elevated 10-100x over white males, with the highest one-to-many false positives for African-American women; Buolamwini & Gebru's Gender Shades found commercial gender-classification error up to 34.7% for darker-skinned women vs 0.8% for lighter-skinned men. Documented downstream harm includes at least 8-15 US wrongful arrests, nearly all of Black people. Honest caveat: magnitude is highly algorithm-dependent — the most accurate algorithms show small or statistically undetectable differentials — so the harm is real but not uniform across systems.
Sources: Grother, Ngan & Hanaoka 2019 (NISTIR 8280, FRVT Part 3: Demographic Effects); Buolamwini & Gebru 2018 (Gender Shades, PMLR 81); Hill 2020 / Williams v. City of Detroit (ACLU 2021)
Rigorous evidence that GOVERNANCE of biometric ID reduces the documented harms is sparse. The one quantitative impact evaluation of police facial-recognition policy (Johnson et al. 2024, difference-in-differences across 268 US cities) studies effects on violent crime — a crime-control outcome, not misidentification harm — from a single research group, and does not establish that any safeguard regime curbs wrongful identification. Direct evidence on procedural safeguards points the other way: in the known wrongful-arrest cases police are reported to have bypassed required corroboration/probable-cause standards, and the strongest documented enforcement levers are private-sector biometric-privacy laws — Illinois BIPA (e.g. Meta's $650M settlement) and the separate Texas CUBI law (a $1.4B Meta settlement) — which govern private actors, not the law-enforcement context where the arrests occur. No replicated study shows a specific regulatory regime measurably reduces demographic misidentification harm.
Sources: Johnson et al. 2024 (Cities, 'Police facial recognition applications and violent crime control in U.S. cities'); Harwell & Schaffer 2025 (Washington Post, 'Arrested by AI'); Illinois BIPA (Rosenbach v. Six Flags 2019; Meta $650M settlement 2021); Texas CUBI (Meta $1.4B settlement 2024)
AI in Criminal Justice
Whether algorithmic risk assessment reproduces racial disparity is a genuine, partly mathematically irreducible dispute rather than merely an unresolved measurement question. ProPublica's analysis of COMPAS in Broward County found Black defendants who did not reoffend were nearly twice as likely to be flagged high-risk as comparable white defendants (44.9% vs 23.5% false-positive rate; Angwin et al. 2016), and Dressel & Farid (2018) showed COMPAS is no more accurate (65.2%) than untrained laypeople (67.0%); the developer's reanalysis (Flores, Bechtel & Lowenkamp 2016) found the same tool satisfies predictive parity and calibration across race. Honest caveat: Chouldechova (2017) proved both sides can be correct simultaneously — when recidivism base rates differ across groups, equal calibration and equal error rates cannot both hold, so the disagreement is partly definitional, not merely a data dispute to be settled.
Sources: Angwin, Larson, Mattu & Kirchner 2016 (ProPublica, 'Machine Bias'); Dressel & Farid 2018 (Science Advances 4:eaao5580); Flores, Bechtel & Lowenkamp 2016 (Federal Probation 80(2):38); Chouldechova 2017 (Big Data 5(2):153)
Rigorous evidence that governing criminal-justice algorithms — mandating, auditing, or adopting risk tools — reduces the racial-disparity harm that motivates the rules is essentially absent. The leading real-world impact evaluation, Stevenson's (2018) study of Kentucky's mandatory pretrial risk-assessment law (>1M cases), found only a small increase in pretrial release that eroded as judges reverted to prior habits, with no reduction in racial disparities in pretrial detention. The closest analogue evaluations measure operational crime outcomes, not equity, and are largely null: Chicago's Strategic Subjects List had no effect on victimization (Saunders, Hunt & Hollywood 2016) and the only randomized predictive-policing trials tested crime reduction, not disparate impact (Mohler et al. 2015) — so the evidence that any governance regime measurably reduces algorithmic racial disparity is itself missing.
Sources: Stevenson 2018 (Minnesota Law Review 103:303); Saunders, Hunt & Hollywood 2016 (Journal of Experimental Criminology 12(3):347); Mohler et al. 2015 (JASA 110(512):1399)
Development-Rights Framings
Development-rights framing is a normative/doctrinal frame, so its empirical status splits: the underlying North-South asymmetry it responds to is real and documented, but the claim that a development-rights diagnosis is the correct one is contested doctrine, not a settled finding. The strongest empirical anchor is the exploitative-data-labour evidence — Miceli & Posada's (2022) multi-method qualitative study of Latin American annotation work (Foucauldian dispositif analysis of 210 instruction documents, 55 interviews, plus participant observation) found workers paid cents-per-task with strict surveillance and whose worldviews are subordinated to requesters' — which substantiates the extraction the frame names, building on the data-colonialism thesis (Couldry & Mejias 2019), and extended by comparative political-economy work on AI annotation 'data empires' (Wu, Muldoon & Xia 2025). Honest caveat: whether 'digital self-determination' or 'Global-South sovereignty' is the right operational response (and whether it conflicts with the EU AIA's rights-based design) is a conceptual/legal question with essentially no empirical evidence base — the frame is established as a critique, thin as a tested governance prescription.
Sources: Miceli & Posada 2022, 'The Data-Production Dispositif' (Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Art. 460:1-37); Couldry & Mejias 2019, 'Data Colonialism' (Television & New Media 20(4):336-349); Wu, Muldoon & Xia 2025, 'Global data empires' (Big Data & Society 12(2))
There is no rigorous impact evaluation showing that development-rights / digital-self-determination / sovereignty governance achieves its stated developmental or self-determination aims — the evidence that the frame 'works' as policy is itself missing, largely because the frame is recent, heterogeneous, and rarely instantiated in a single measurable instrument. The closest empirical literature studies one common operational proxy (data localization) and measures economic cost rather than the frame's goals: Ferracane, Kren & van der Marel's (2020) firm/industry productivity analysis finds data-policy restrictiveness associated with lower TFP in data-intensive downstream sectors, Ferracane & van der Marel's (2021) gravity analysis finds data restrictions inhibit trade in digital services, and Bauer, Lee-Makiyama, van der Marel & Verschelde's (2014) GTAP general-equilibrium estimates project GDP losses from localization across seven jurisdictions including Brazil and India. None tests whether sovereignty framing reduces extractive asymmetry or advances local AI capability — so claims on both the benefit and cost sides rest on weak or indirect evidence.
Sources: Ferracane, Kren & van der Marel 2020, 'Do data policy restrictions impact the productivity performance of firms and industries?' (Review of International Economics 28(3):676-722); Ferracane & van der Marel 2021, 'Do data policy restrictions inhibit trade in services?' (Review of World Economics 157(4):727-776); Bauer, Lee-Makiyama, van der Marel & Verschelde 2014, 'The Costs of Data Localisation: Friendly Fire on Economic Recovery' (ECIPE Occasional Paper 3/2014)
AI in Education
The documented harms of educational AI are empirically real and, for proctoring, replicated: a controlled audit of a proctoring tool used by at least ~1,500 institutions found significantly higher facial-detection failure (the trigger for 'suspicious' flags) for darker-skinned and female test-takers (Yoder-Himes et al. 2022), and a technical audit of 164 government-endorsed pandemic learning products found 89% engaged in data practices that risk or infringe children's rights, with most monitoring happening without the child's knowledge or consent (Human Rights Watch 2022). Honest caveat: the benefit side is genuine but highly sensitive to how outcomes are measured rather than uniform — Kulik & Fletcher's meta-analysis of 50 intelligent-tutoring evaluations found an overall median effect of 0.66 SD, but the average effect was 0.73 SD on locally-developed tests versus only 0.13 SD on standardized tests, so much of AI education's apparent value depends on the outcome measure used.
Sources: Yoder-Himes et al. 2022, 'Racial, skin tone, and sex disparities in automated proctoring software', Frontiers in Education 7:881449; Human Rights Watch 2022, 'How Dare They Peep into My Private Life?' (164 EdTech products endorsed by 49 governments; 89% risked/infringed children's rights); Kulik & Fletcher 2016, 'Effectiveness of Intelligent Tutoring Systems: A Meta-Analytic Review', Review of Educational Research 86(1):42-78
There are essentially no rigorous impact evaluations showing that purpose-built governance of educational AI reduces the documented harms. The student-specific regime — California's SOPIPA (SB 1177, 2014, a model that more than 20 states adopted and ~33 considered) and the FTC's May 2022 COPPA ed-tech policy statement (which the agency itself said did not change existing requirements) — has near-zero documented enforcement and no published before/after evaluation of whether it changed vendor data practices or bias outcomes. The only documented remedies came not from education-specific rules but from generic legal levers: a $6.25M biometric-privacy class settlement under Illinois BIPA (Veiga v. Respondus, 2023) and a constitutional ruling that proctoring room-scans are an unreasonable search (Ogletree v. Cleveland State University, N.D. Ohio 2022, Calabrese J.) — neither of which is a replicable evaluation, and both reach private/state actors rather than the underlying demographic-bias harm.
Sources: California SOPIPA (SB 1177, 2014); FTC Policy Statement on Education Technology and COPPA (adopted May 19, 2022); Veiga v. Respondus, Inc. ($6.25M BIPA class settlement, 2023; covers Illinois Respondus Monitor users Nov. 2015–June 2023); Ogletree v. Cleveland State University (N.D. Ohio 2022, Calabrese J., room-scan Fourth Amendment ruling)
AI in Employment
Discrimination and adverse outcomes in employment decisions are empirically well-established, and AI systems demonstrably reproduce them. The foundational field-experiment literature shows robust human baseline discrimination (Bertrand & Mullainathan 2004 found White-sounding names received 50% more callbacks), and AI-specific audits confirm the pattern: Amazon scrapped a recruiting tool that penalized resumes containing 'women's' (Dastin 2018), and a controlled resume-screening audit of language-model retrieval found systems favored White-associated names ~85% of the time and never preferred Black male-associated over White male-associated names (Wilson & Caliskan 2024). On the monitoring side, a meta-analysis (k=94, N≈23,461) found electronic performance monitoring reliably raises worker stress with no evidence of improved performance (Ravid et al. 2023). Honest caveat: measured disparities are highly model-, prompt-, and context-dependent, and most evidence comes from controlled audits and one firm's internal test rather than measured outcomes in live, at-scale hiring pipelines.
Sources: Bertrand & Mullainathan 2004 (American Economic Review 94(4):991-1013); Wilson & Caliskan 2024 (AAAI/ACM AIES; 'Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval'); Dastin 2018 (Reuters, 'Amazon scraps secret AI recruiting tool that showed bias against women'); Ravid, White, Tomczak & Behrend 2023 (Personnel Psychology 76:5-40)
There is no rigorous evidence that governing AI in employment reduces the documented harms; the central evaluated regime appears to fail at the compliance stage before any impact on bias can occur. NYC Local Law 144 — the first jurisdiction worldwide to mandate independent bias audits and public posting for automated employment decision tools — was directly studied across 391 employers and found to produce 'null compliance': the law's discretion makes it impossible to tell whether firms comply, with very few posting the required audits (Wright et al. 2024). Parallel qualitative work shows the audits themselves are undermined by missing demographic data, opaque aggregation, and 'test data' that does not reflect real use (Groves et al. 2024). No study links any AI-employment rule to a measured reduction in discriminatory hiring outcomes — the evidence that the rule works is itself missing, largely because mandated transparency artifacts (audit reports) are sparse, non-standardized, and unenforced.
Sources: Wright, Muenster, Vecchione, Metcalf & Matias et al. 2024 ('Null Compliance: NYC Local Law 144 and the Challenges of Algorithm Accountability', ACM FAccT '24); Groves, Metcalf, Kennedy, Vecchione & Strait 2024 ('Auditing Work: Exploring the New York City algorithmic bias audit regime', ACM FAccT '24); Ravid, White, Tomczak & Behrend 2023 (Personnel Psychology 76:5-40, on monitoring outcomes as the closest analogue evaluation evidence)
Environmental Impact of AI Training
The resource demands of AI compute are empirically documented at the model level: Strubell et al. (2019) quantified large-NLP training energy/carbon, Luccioni et al. (2023) estimated BLOOM's training at ~24.7 tCO2eq (dynamic power) rising to ~50.5 tCO2eq with manufacturing and deployment, Li et al. (2023) estimated GPT-3-scale training in US datacenters can evaporate on the order of hundreds of thousands of litres of freshwater (their central figure ~700,000 L), and Luccioni, Jernite & Strubell (2024) showed generative inference is markedly more energy-intensive per query than task-specific models; at the macro scale the IEA (2024) and de Vries (2023) document rapidly rising datacenter electricity demand. Honest caveat: absolute estimates vary by up to orders of magnitude with grid carbon intensity, hardware, utilisation and accounting boundaries, and cleanly attributing the AI-specific increment (versus general datacenter and crypto growth) remains genuinely contested — the IEA itself bundles AI with datacenters and crypto — so the existence of the footprint is established while its magnitude and trajectory are not.
Sources: Strubell, Ganesh & McCallum 2019 (ACL Anthology P19-1355; 'Energy and Policy Considerations for Deep Learning in NLP'); Luccioni, Viguier & Ligozat 2023 (JMLR 24; BLOOM 176B carbon footprint, 24.7/50.5 tCO2eq; arXiv:2211.02001); Li, Yang, Islam & Ren 2023 (arXiv:2304.03271, 'Making AI Less Thirsty', later Comm. ACM 2025); Luccioni, Jernite & Strubell 2024 (ACM FAccT '24, 'Power Hungry Processing', DOI 10.1145/3630106.3658542); de Vries 2023 (Joule 7(10):2191-2194, DOI 10.1016/j.joule.2023.09.004); IEA 2024 (Electricity 2024)
There is no impact evaluation showing that any AI-specific environmental-governance instrument reduces energy, water or carbon use, because every named instrument is voluntary or non-binding and very recent: EU AI Act Art. 95 codes of conduct are explicitly optional with no sanctions, and NIST AI 600-1 and the G7 Hiroshima Code are guidance, not enforceable caps. The closest analogue evaluation literature is divided in a way that disfavours the voluntary form chosen here: rigorous reviews find voluntary environmental programs generally fail to produce significant abatement beyond business-as-usual (Koehler 2007; Morgenstern & Pizer 2007), whereas the one form with credible positive evidence is mandatory disclosure (Downar et al. 2021 found a UK carbon-reporting mandate cut emissions ~8% versus a control group) which the AI instruments do not yet impose, leaving the proposition that AI environmental governance works essentially untested.
Sources: EU AI Act Art. 95 / Recital 142 (Reg. (EU) 2024/1689); NIST AI 600-1 (2024, GenAI Profile); G7 Hiroshima Process International Code of Conduct (30 Oct 2023); Koehler 2007 (Policy Studies Journal 35(4):689-722); Morgenstern & Pizer (eds.) 2007 (Reality Check, RFF Press); Downar, Ernstberger, Reichelstein, Schwenen & Zaklan 2021 (Review of Accounting Studies 26(3):1137-1175)
AI in Healthcare
Both the benefit and the harm of clinical AI are empirically real and well-documented, but outcomes are highly deployment-dependent. Rigorous prospective studies show genuine clinical value in narrow tasks — the MASAI RCT (>100,000 women) found AI-supported mammography detected ~20% more cancers (6.1 vs 5.1 per 1000 screened) at comparable recall rates (Lang et al. 2023, Lancet Oncology), and IDx-DR's pivotal trial achieved 87.2% sensitivity / 90.7% specificity for diabetic retinopathy (Abramoff et al. 2018, npj Digital Medicine) — yet widely deployed models can fail or harm: the Epic Sepsis Model, live at hundreds of US hospitals, scored AUC 0.63 with 33% sensitivity on external validation (Wong et al. 2021, JAMA Internal Medicine), and a population-health algorithm covering ~200M people understated Black patients' illness because it predicted cost not need (Obermeyer et al. 2019, Science). Honest caveat: there is no single 'AI in healthcare' effect — performance ranges from life-saving to dangerous depending on task, calibration, and whether the model was prospectively validated.
Sources: Lang K, Josefsson V, Larsson A-M, et al. 2023 (Lancet Oncology 24(8):936-944, MASAI trial clinical safety analysis; AI-supported screening detected 6.1 vs 5.1 cancers per 1000, ~20% higher, similar recall rates); Abramoff MD, Lavin PT, Birch M, Shah N, Folk JC. 2018 (npj Digital Medicine 1:39, IDx-DR pivotal trial; 87.2% sensitivity / 90.7% specificity); Wong A, Otles E, Donnelly JP, et al. 2021 (JAMA Internal Medicine 181(8):1065-1070, Epic Sepsis Model external validation; AUC 0.63, 33% sensitivity); Obermeyer Z, Powers B, Vogeli C, Mullainathan S. 2019 (Science 366(6464):447-453, racial bias from cost-as-proxy)
There is essentially no impact-evaluation evidence that the prevailing governance regime for medical AI — FDA authorization, predominantly via the 510(k) substantial-equivalence pathway — measurably reduces patient harm or improves outcomes. Analyses of authorized AI devices find that clinical validation is frequently absent or non-prospective (of 521 FDA-authorized AI devices, ~43% had no published clinical-validation data and only ~28% were prospectively validated; Chouffani El Fassi & Henderson et al. 2024) and that demographic performance is almost never reported (race/ethnicity in 3.6%, and only 9.0% of 692 510(k)/cleared AI devices carried a prospective post-market-surveillance study; Muralidharan et al. 2024). Earlier analysis of 130 cleared devices likewise found 97% were evaluated only retrospectively (Wu et al. 2021). The closest analogue evidence on the pathway itself is discouraging: the Institute of Medicine (2011) concluded the 510(k) process was not designed to assess safety and effectiveness — i.e., no direct study establishes that the rule, as written, prevents the harms it targets. Caveat: this is an absence of impact evaluation plus reporting-gap and design-critique evidence, not a study showing the regime fails to reduce harm.
Sources: Chouffani El Fassi S, Abdullah A, Fang Y, ... Henderson GE, et al. 2024 (Nature Medicine, 'Not all AI health tools with regulatory authorization are clinically validated', s41591-024-03203-3; 521 devices, ~43% no clinical validation, ~28% prospectively validated); Muralidharan V, Adewale BA, Huang CJ, et al. 2024 (npj Digital Medicine 7:273, scoping review of reporting gaps in 692 FDA-approved AI medical devices; race/ethnicity 3.6%, prospective post-market surveillance 9.0%); Wu E, Wu K, Daneshjou R, Ouyang D, Ho DE, Zou J. 2021 (Nature Medicine 27:582-584, analysis of 130 FDA approvals; 97% retrospective-only evaluation); Institute of Medicine 2011 (Medical Devices and the Public's Health: The FDA 510(k) Clearance Process at 35 Years)
International Coordination
The DESCRIPTIVE premise is well-established: IR scholarship now treats global AI governance as a fragmented 'regime complex' of partially overlapping G7/G20/OECD/GPAI/UN/standards-body arrangements with no central hierarchy (Tallberg et al. 2023 — verified verbatim: 'the emerging governance architecture for AI can be described as a regime complex'; Cihon, Maas & Kemp 2020). But the implied HARM — that forum-shopping and regulatory arbitrage cause a measurable race-to-the-bottom or relocate AI development to lax jurisdictions — is largely theorized/anticipated rather than empirically demonstrated for AI; Tallberg et al. explicitly flag forum-shopping as a dynamic whose presence in the AI regime complex is an open empirical question ('Establishing whether these patterns and dynamics are key features also of the AI regime complex stand out as important priorities in future research'). Honest caveat: the strongest empirical arbitrage evidence comes from analogue footloose digital markets (e.g., ICO reallocation after US securities enforcement) — itself a mixed/contested literature — not from AI firms, so the magnitude of coordination-failure harm in AI specifically remains contested and under-measured.
Sources: Tallberg, Erman, Furendal, Geith, Klamberg & Lundgren 2023 (International Studies Review 25(3): viad040); Cihon, Maas & Kemp 2020 (Should AI Governance be Centralised?, AIES '20: 228-234); Lancieri, Edelson & Bechtold 2025 (AI Regulation: Competition, Arbitrage & Regulatory Capture, Theoretical Inquiries in Law 26(1): 239-262)
There are essentially no impact evaluations showing that the negotiated-coordination mode (AI Safety Institute network MoUs, forum-shifting, multilateral declarations) actually produces regulatory convergence or reduces arbitrage — the AISI Network began only as a statement of intent at the Seoul Summit (Seoul Statement of Intent, 21 May 2024) and held its first operational meeting in November 2024, with no defined metrics or outcome studies, so these soft-law instruments are too new to have measurable effects. The closest analogue evidence is mixed and works through DIFFERENT mechanisms than this topic describes: Bradford's Brussels Effect documents de-facto convergence driven by market access rather than negotiated coordination, and the FATF transgovernmental-network literature shows peer-review mutual evaluation can drive AML convergence — but neither evaluates voluntary AI MoU networks, and FATF's effects come with well-documented unintended consequences (de-risking, financial exclusion). The plain finding: the evidence that AI-governance coordination 'works' is itself missing.
Sources: Bradford 2020 (The Brussels Effect: How the European Union Rules the World, Oxford University Press); Nance 2018 (The regime that FATF built: an introduction to the Financial Action Task Force, Crime, Law and Social Change 69(2): 109-129; cf. Slaughter 2004, A New World Order, Princeton University Press); International Network of AI Safety Institutes — Seoul Statement of Intent toward International Cooperation on AI Safety Science (21 May 2024; network's first meeting San Francisco, Nov 2024)
Individual Redress
The premise behind redress — that affected people lack meaningful recourse against automated decisions — is real, but the flagship instrument is weaker than commonly assumed. Wachter, Mittelstadt & Floridi (2017) show GDPR creates only a limited 'right to be informed,' not a binding 'right to explanation' of specific decisions; and controlled work finds the explanations actually delivered do not measurably improve lay decision accuracy over showing the bare AI prediction (Alufaisan et al. 2021; and a 2022 meta-analysis by Schemmer et al. — screening 393 articles down to 9 in the final analysis — reports 'no effect of explanations on users' performance compared to sole AI predictions,' even though XAI overall had a positive effect). Honest caveat: the legitimacy/dignity value of being heard is empirically well established in the procedural-justice tradition even where outcome accuracy is unchanged, so 'redress fails' depends on which aim is measured.
Sources: Wachter, Mittelstadt & Floridi 2017 (International Data Privacy Law 7(2):76); Alufaisan, Marusich, Bakdash, Zhou & Kantarcioglu 2021 (Proceedings of the AAAI Conference on AI 35(8):6618); Schemmer, Hemmer, Nitsche, Kühl & Vössing 2022 (AAAI/ACM AIES '22, meta-analysis)
There is no rigorous impact evaluation showing that mandated redress mechanisms (right-to-explanation, appeal, human-in-the-loop review) actually reduce erroneous or unfair automated decisions — the evidence that the rule works is itself missing. The closest experimental analogues are discouraging: explanations increase humans' acceptance of AI recommendations regardless of correctness (Bansal et al. 2021), and algorithm-in-the-loop oversight can introduce racial disparities and exhibit automation bias rather than reliably catching model errors (Green & Chen 2019). The procedural-justice literature (Tyler 1990; Lind & Tyler 1988) robustly supports a legitimacy and compliance benefit of fair process, but it measures perceived fairness, not reduction of the substantive decision harm redress is meant to cure.
Sources: Bansal, Wu, Zhou, Fok, Nushi, Kamar, Ribeiro & Weld 2021 (CHI '21); Green & Chen 2019 (Disparate Interactions, ACM FAT* '19); Tyler 1990 (Why People Obey the Law, Yale Univ. Press); Lind & Tyler 1988 (The Social Psychology of Procedural Justice, Plenum Press)
Training-Data Rights
That foundation models ingest copyrighted and personal works without consent is undisputed; whether that ingestion produces legally cognizable reproduction harm is genuinely contested. The CS evidence that models can memorize and emit verbatim training text is robust and replicated — Carlini et al. (2021) extracted hundreds of verbatim sequences (including PII) from GPT-2, and follow-up work (Carlini et al., Quantifying Memorization, ICLR 2023) showed extraction scales log-linearly with model size and with example duplication. Honest caveat: verbatim reproduction is the exception, not the norm — the UK High Court held that Stable Diffusion's model weights never stored copies of the training images (defeating the secondary-infringement theory), and Getty abandoned its primary training-infringement claim at trial for lack of evidence, so whether the empirical phenomenon amounts to actionable harm (rather than transient, non-expressive use) remains the open question driving NYT v. OpenAI and parallel regimes.
Sources: Carlini, Tramèr, Wallace, Jagielski, Herbert-Voss, Lee, Roberts, Brown, Song, Erlingsson, Oprea & Raffel 2021 (Extracting Training Data from Large Language Models, 30th USENIX Security Symposium); Carlini, Ippolito, Jagielski, Lee, Tramèr & Zhang 2023 (Quantifying Memorization Across Neural Language Models, ICLR 2023; arXiv:2202.07646); Getty Images (US) Inc & ors v Stability AI Ltd [2025] EWHC 2863 (Ch) (UK High Court, 4 Nov 2025 — no secondary infringement; primary training claim abandoned at trial); The New York Times Co. v. Microsoft Corp. & OpenAI (S.D.N.Y., No. 1:23-cv-11195; consolidated In re OpenAI Copyright Infringement Litigation, Apr. 2025; ongoing 2025-2026)
There is no impact evaluation showing that the CDSM Directive Article 4 TDM exception plus its Article 4(3) opt-out reservation regime actually reduces unlicensed ingestion or channels compensation to rightsholders — the evidence that the rule works as designed is itself missing. The only available evidence is early case law and doctrinal scholarship, which document the mechanism's contested operation rather than its success: in Kneschke v. LAION the Hamburg Higher Regional Court (on appeal, 10 Dec 2025) held that a rights reservation in natural language did NOT satisfy Article 4(3)'s machine-readability requirement, invalidating the opt-out (note: the first-instance Regional Court had left the Article 4 question largely open and the case ultimately turned on the Article 3 scientific-research exception, so this machine-readability holding is appellate and not yet settled — a further appeal to the Federal Court of Justice was permitted). Legal scholars characterize the Article 4 opt-out as practically difficult and unharmonized, with no observed market in TDM licences or systematic enforcement to evaluate.
Sources: Kneschke v. LAION (Hamburg Regional Court, 27 Sept 2024, 310 O 227/23; on appeal Hamburg Higher Regional Court, 10 Dec 2025, 5 U 104/24 — opt-out held not machine-readable; further appeal to BGH permitted); Margoni & Kretschmer 2022 (A Deeper Look into the EU Text and Data Mining Exceptions, GRUR International 71(8):685-701); Quintais 2025 (Generative AI, Copyright and the AI Act, Computer Law & Security Review 56:106107)
Transparency Obligations
Documentation artifacts (model cards, datasheets) are well-specified as proposals and are genuinely adopted, but the empirical premise that mandated disclosure produces meaningful transparency is contested. Selbst & Barocas (2018) argue inscrutability and non-intuitiveness are distinct problems and that disclosing rules does not resolve the latter, and large-scale audits find documentation is sparsely and unevenly completed: a systematic analysis of 32,111 Hugging Face model cards (Liang et al. 2024) found environmental-impact, limitations and evaluation sections least often filled, and Bhat et al. (2023, 45 practitioners) found a substantial gap between the documentation proposal and actual practice. Honest caveat: the documentation frameworks themselves are real and adopted, so the dispute is about whether disclosure conveys decision-relevant information, not whether the artifacts exist.
Sources: Selbst & Barocas 2018 (Fordham Law Review 87:1085-1139); Liang et al. 2024 (Nature Machine Intelligence, s42256-024-00857-z, 'Systematic analysis of 32,111 AI model cards'); Bhat et al. 2023 (CHI '23, 'Aspirations and Practice of ML Model Documentation', DOI 10.1145/3544548.3581518); Mitchell et al. 2019 (FAccT, Model Cards for Model Reporting); Gebru et al. 2021 (CACM 64(12):86-92, Datasheets for Datasets)
There is no rigorous impact evaluation showing that AI transparency mandates (model cards, training-data summaries) measurably reduce bias, misuse or accidents — the central regulatory assumption is empirically untested, partly because flagship mandates like EU AI Act Art. 53(1)(d) GPAI training-data summaries are only subject to AI Office enforcement/verification from 2 August 2026 (the obligation itself began 2 August 2025 for new models). The closest analogue, mandated consumer disclosure, shows small and context-dependent effects: Bollinger, Leslie & Sorensen (2011) found mandatory calorie posting cut average calories per transaction by about 6%, while Loewenstein, Sunstein & Golman (2014) review evidence that disclosure effects are frequently diminished or even reversed by limited attention and often change provider rather than recipient behavior. These are analogues, not AI studies; no study demonstrates that AI transparency disclosure achieves its stated downstream safety aims.
Sources: Bollinger, Leslie & Sorensen 2011 (AEJ: Economic Policy 3(1):91-128); Loewenstein, Sunstein & Golman 2014 (Annual Review of Economics 6:391-419, 'Disclosure: Psychology Changes Everything'); EU AI Act Art. 53(1)(d) GPAI training-data summary (obligation from 2 Aug 2025; AI Office enforcement from 2 Aug 2026)