Anthropic Responsible Scaling Policy (RSP) v2 is a Voluntary code from US, adopted on 2024-10-15 and effective 2024-10-15. Current status: In force. First-mover industry safety framework. Introduces the AI Safety Level (ASL) capability-tier vocabulary subsequently adapted by OpenAI Preparedness + DeepMind FSF. v2 (Oct 2024) refines ASL-3/ASL-4 capability thresholds, mandates pre-deployment capability evaluations, and commits to a Frontier Red Team. Seoul Frontier AI Safety Commitments signatory; cited by name in EU AI Office GPAI Code of Practice drafts.
Scope and obligations
First-mover industry safety framework. Introduces the AI Safety Level (ASL) capability-tier vocabulary subsequently adapted by OpenAI Preparedness + DeepMind FSF. v2 (Oct 2024) refines ASL-3/ASL-4 capability thresholds, mandates pre-deployment capability evaluations, and commits to a Frontier Red Team. Seoul Frontier AI Safety Commitments signatory; cited by name in EU AI Office GPAI Code of Practice drafts.
Anthropic Responsible Scaling Policy (RSP) v2 addresses 3 contested AI-governance topics explicitly, 2 via general principles,.
Topics governed
- governsFoundation Models / GPAI— RSP v2 §2 — ASL framework applies to frontier model releases
- implicitCompute-Threshold Reporting— RSP v2 capability evaluations triggered by capability rather than pure compute; compute is one signal
- governsTransparency Obligations— RSP v2 §5 — public publication of safety determinations + capability eval methodology
- governsCatastrophic & Existential Risk— RSP v2 §3 — ASL-3 / ASL-4 capability thresholds explicitly target CBRN uplift + autonomous-replication
- implicitInternational Coordination— Seoul Frontier AI Safety Commitments signatory; coordinates with US + UK AISIs on capability evaluation
Cross-jurisdiction comparison
How peer instruments treat the topics Anthropic Responsible Scaling Policy (RSP) v2 governs.
| Topic | EU-AIA-2024 | US-EO-14110 | US-EO-14179 | UK-WHITEPAPER-2023 | CN-GENAI-2023 | G7-HIROSHIMA | OECD-AI-PRIN | COE-AI-CONV | UN-RES-2024 | NIST-AI-RMF | BLETCHLEY-2023 | SEOUL-2024 | NIST-AI-RMF-GENAI | CA-SB-1047 | IN-DPDP-2023 | BR-AIBILL-2024 | ASEAN-AI-GUIDE-2024 | AU-AI-STRATEGY-2024 | OPENAI-PREPAREDNESS-2023 | DEEPMIND-FSF-2024 | META-FRONTIER-2024 | UK-US-AISI-MOU-2024 | WH-VOLUNTARY-2023 | SG-MODEL-AI-2024 | JP-METI-AI-2024 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Foundation Models / GPAI | governs | governs | silent | implicit | governs | governs | implicit | implicit | silent | governs | governs | governs | governs | governs | implicit | governs | implicit | silent | governs | governs | governs | governs | governs | governs | governs |
| Transparency Obligations | governs | implicit | silent | implicit | conflicts | governs | governs | governs | implicit | governs | implicit | governs | governs | implicit | implicit | governs | governs | silent | implicit | implicit | governs | implicit | governs | governs | governs |
| Catastrophic & Existential Risk | implicit | governs | silent | implicit | silent | governs | silent | silent | implicit | implicit | governs | governs | governs | governs | silent | governs | silent | silent | governs | governs | governs | implicit | implicit | silent | silent |
How to cite this article
APA 7
Policy Window. (2024). Anthropic Responsible Scaling Policy (RSP) v2 [Wiki article]. https://policywindow.org/wiki/anthropic-rsp
Chicago 17
Policy Window. 2024. "Anthropic Responsible Scaling Policy (RSP) v2." Wiki article. https://policywindow.org/wiki/anthropic-rsp.
BibTeX
@misc{policywindow-anthropic-rsp,
title = {Anthropic Responsible Scaling Policy (RSP) v2},
author = {Policy Window},
year = {2024},
howpublished = {Anthropic Responsible Scaling Policy v2 (Oct 2024)},
url = {https://policywindow.org/wiki/anthropic-rsp},
note = {Primary source: https://www.anthropic.com/news/announcing-our-updated-responsible-scaling-policy}
}Related instruments
- EU AI Act · EU
- Executive Order 14110 on Safe, Secure, Trustworthy AI · US
- G7 Hiroshima AI Process Code of Conduct · G7
- NIST AI Risk Management Framework · US
- Bletchley Declaration on AI Safety · global
- Seoul Declaration on Safe, Innovative and Inclusive AI · global
- NIST AI RMF Generative AI Profile · US
- California SB-1047: Safe and Secure Innovation for Frontier AI Models Act · US
- Brazil AI Bill (PL 2338/2023) · BR
- OpenAI Preparedness Framework · US
- Google DeepMind Frontier Safety Framework · US
- Meta Frontier AI Framework · US
- White House Voluntary AI Commitments · US
- Singapore Model AI Governance Framework for Generative AI · SG
- Japan METI AI Guidelines for Business · JP
References
- Anthropic Responsible Scaling Policy v2 (Oct 2024)
- RSP v2 §2 — ASL framework applies to frontier model releases
- RSP v2 capability evaluations triggered by capability rather than pure compute; compute is one signal
- RSP v2 §5 — public publication of safety determinations + capability eval methodology
- RSP v2 §3 — ASL-3 / ASL-4 capability thresholds explicitly target CBRN uplift + autonomous-replication
- Seoul Frontier AI Safety Commitments signatory; coordinates with US + UK AISIs on capability evaluation
Take this further — sign up free
Save, compare, or get alerts when Anthropic Responsible Scaling Policy (RSP) v2 changes. Policy Window is the analyst workbench layered on top of this wiki — free for researchers, civil society, and verified policymakers.