Alignment Tax
The potential reduction in model capabilities or performance that results from implementing safety measures, alignment techniques or responsible AI constraints.
In Plain Language
The trade-off between making AI safe and making it capable. Adding safety features sometimes makes the AI slightly less powerful or flexible; that's the "tax" you pay for safety.
Why This Matters
Understanding the alignment tax helps governance teams make informed trade-offs between AI capability and safety. Your governance framework should explicitly address how these trade-offs are evaluated and who has authority to make them.
.png)
