Anthropic Responsible Scaling Policy (RSP) v2
ANTHROPIC-RSP-2024 · US
Anthropic Responsible Scaling Policy (RSP) v2 is a Voluntary code from US, adopted on 2024-10-15 and effective 2024-10-15. Current status: In force. First-mover industry safety framework. Introduces the AI Safety Level (ASL) capability-tier vocabulary subsequently adapted by OpenAI Preparedness + DeepMind FSF. v2 (Oct 2024) refines ASL-3/ASL-4 capability thresholds, mandates pre-deployment capability evaluations, and commits to a Frontier Red Team. Seoul Frontier AI Safety Commitments signatory; cited by name in EU AI Office GPAI Code of Practice drafts. NOTE (iter-314): the RSP is a versioned-evolving artefact; this row pins v2 (Oct 2024) as the load-bearing reference, but Anthropic publishes incremental updates on the policy page. Citers tracking specific ASL-4 threshold language should confirm against the current version on anthropic.com — the catalog re-pins on the next Coverage Games event.
Scope and obligations
First-mover industry safety framework. Introduces the AI Safety Level (ASL) capability-tier vocabulary subsequently adapted by OpenAI Preparedness + DeepMind FSF. v2 (Oct 2024) refines ASL-3/ASL-4 capability thresholds, mandates pre-deployment capability evaluations, and commits to a Frontier Red Team. Seoul Frontier AI Safety Commitments signatory; cited by name in EU AI Office GPAI Code of Practice drafts. NOTE (iter-314): the RSP is a versioned-evolving artefact; this row pins v2 (Oct 2024) as the load-bearing reference, but Anthropic publishes incremental updates on the policy page. Citers tracking specific ASL-4 threshold language should confirm against the current version on anthropic.com — the catalog re-pins on the next Coverage Games event.
Anthropic Responsible Scaling Policy (RSP) v2 addresses 4 contested AI-governance topics explicitly, 6 via general principles,.
Topics governed
- governsFoundation Models / GPAI— RSP v2 §2 — ASL framework applies to frontier model releases
- implicitCompute-Threshold Reporting— RSP v2 capability evaluations triggered by capability rather than pure compute; compute is one signal
- governsTransparency Obligations— RSP v2 §5 — public publication of safety determinations + capability eval methodology
- governsCatastrophic & Existential Risk— RSP v2 §3 — ASL-3 / ASL-4 capability thresholds explicitly target CBRN uplift + autonomous-replication
- implicitInternational Coordination— Seoul Frontier AI Safety Commitments signatory; coordinates with US + UK AISIs on capability evaluation
- governsAgentic AI Governance— RSP v2 — ASL thresholds include 'autonomous AI replication' + agentic capability evaluations
- implicitOpen-Weight Frontier Release— RSP applies to Anthropic's models which are closed-weight; framework does not address third-party open release
- implicitSynthetic Content Provenance— Deployment-stage controls would include content provenance where capability tier requires
- implicitAI in Elections— Anthropic's election-integrity acceptable-use policies apply to deployments; not in RSP itself
- implicitCompute + Model-Weight Export Controls— ASL-3+ tiers include model-weight access controls (recipient-restriction analog)
Cross-jurisdiction comparison
How peer instruments treat the topics Anthropic Responsible Scaling Policy (RSP) v2 governs.
| Topic | EU-AIA-2024 | US-EO-14110 | US-EO-14179 | UK-WHITEPAPER-2023 | CN-GENAI-2023 | G7-HIROSHIMA | OECD-AI-PRIN | COE-AI-CONV | UN-RES-2024 | NIST-AI-RMF | BLETCHLEY-2023 | SEOUL-2024 | NIST-AI-RMF-GENAI | CA-SB-1047 | IN-DPDP-2023 | BR-AIBILL-2024 | ASEAN-AI-GUIDE-2024 | AU-AI-STRATEGY-2024 | OPENAI-PREPAREDNESS-2023° | DEEPMIND-FSF-2024° | META-FRONTIER-2024° | UK-US-AISI-MOU-2024 | WH-VOLUNTARY-2023 | SG-MODEL-AI-2024 | JP-METI-AI-2024 | NYC-LL-144-2021 | CO-SB-24-205 | IL-HB-3773-2024 | EU-GDPR-2016 | EU-GPAI-COP-2025 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Foundation Models / GPAI | governs | governs | silent | implicit | governs | governs | implicit | implicit | silent | governs | governs | governs | governs | governs | implicit | governs | implicit | silent | governs | governs | governs | governs | governs | governs | governs | silent | silent | silent | silent | governs |
| Transparency Obligations | governs | implicit | silent | implicit | conflicts | governs | governs | governs | implicit | governs | implicit | governs | governs | implicit | implicit | governs | governs | silent | implicit | implicit | governs | implicit | governs | governs | governs | silent | silent | silent | governs | governs |
| Catastrophic & Existential Risk | implicit | governs | silent | implicit | silent | governs | silent | silent | implicit | implicit | governs | governs | governs | governs | silent | governs | silent | silent | governs | governs | governs | implicit | implicit | silent | silent | silent | silent | silent | silent | governs |
| Agentic AI Governance | implicit | silent | silent | silent | implicit | implicit | silent | implicit | silent | implicit | implicit | governs | governs | silent | silent | implicit | silent | silent | governs | governs | implicit | implicit | silent | silent | silent | silent | silent | silent | silent | silent |
°= industry self-imposed voluntary framework. Comparing a voluntary code's "governs" tint with a binding regulation's "governs" tint flattens the legal-force distinction; use the instrument-page banner for the operative status of each.
How to cite this article
APA 7
Policy Window. (2024). Anthropic Responsible Scaling Policy (RSP) v2 [Wiki article — Instrument]. https://policywindow.org/wiki/anthropic-rsp
Chicago 17
Policy Window. 2024. "Anthropic Responsible Scaling Policy (RSP) v2." Wiki article (Instrument). https://policywindow.org/wiki/anthropic-rsp.
BibTeX
@misc{policywindow-anthropic-rsp,
title = {Anthropic Responsible Scaling Policy (RSP) v2},
author = {Policy Window},
year = {2024},
howpublished = {Anthropic Responsible Scaling Policy v2 (Oct 2024)},
url = {https://policywindow.org/wiki/anthropic-rsp},
note = {Primary source: https://www.anthropic.com/news/announcing-our-updated-responsible-scaling-policy}
}References
- Anthropic Responsible Scaling Policy v2 (Oct 2024)
- RSP v2 §2 — ASL framework applies to frontier model releases
- RSP v2 capability evaluations triggered by capability rather than pure compute; compute is one signal
- RSP v2 §5 — public publication of safety determinations + capability eval methodology
- RSP v2 §3 — ASL-3 / ASL-4 capability thresholds explicitly target CBRN uplift + autonomous-replication
- Seoul Frontier AI Safety Commitments signatory; coordinates with US + UK AISIs on capability evaluation
- RSP v2 — ASL thresholds include 'autonomous AI replication' + agentic capability evaluations
- RSP applies to Anthropic's models which are closed-weight; framework does not address third-party open release
- Deployment-stage controls would include content provenance where capability tier requires
- Anthropic's election-integrity acceptable-use policies apply to deployments; not in RSP itself
- ASL-3+ tiers include model-weight access controls (recipient-restriction analog)
Cite this article
6 formats · 1-click copyPersistent identifier: https://policywindow.org/wiki/anthropic-rsp — committed-stable URL with content-versioning via ?asOf= (rollout pending per methodology §7). DOIs via Zenodo are on the roadmap.
Track this article
Save Anthropic Responsible Scaling Policy (RSP) v2 to your local reading list, follow the RSS changelog for any catalog change, or compare with a peer article. All three work without signup.