1. Responsible Scaling and ASLs
This section explains how Anthropic’s Responsible Scaling Policy (RSP v2.2) and ASL tiers structure the way we think about risk. Instead of treating “frontier model” as a single category, RSP ties capability growth to specific safety and security thresholds.
In RSP v2.2, Anthropic organises frontier model risk into capability bands (ASL tiers). Each band is associated with requirements for evaluations, red-teaming, monitoring, and security controls, and with a commitment not to train or deploy systems that exceed those safeguards. In other words, scaling model capabilities and scaling safety measures are linked.
For counsel, the practical consequence is that discussions about duties and foreseeable misuse later in this pack are anchored to those tiers. When a scenario assumes a certain capability level or deployment pattern, you can ask: which ASL tier does this correspond to, and have the associated commitments been met?
How to use this in legal reasoning
- Use ASL tiers as a shorthand for “what safety bar Anthropic has committed to” for a given class of system.
- When assessing a scenario in the Foreseeable Misuse or Penumbral packs, check whether the assumed safeguards line up with the relevant ASL tier.
- In live matters, use RSP language to ground conversations about acceptable risk and escalation duties when capabilities increase.