Core views on AI safety: when, why, what, and how

Anthropic

Anthropic, March 8, 2023

Abstract

AI progress may lead to transformative AI systems in the next decade, but we do not yet understand how to make such systems safe and aligned with human values. In response, we are pursuing a variety of research directions aimed at better understanding, evaluating, and aligning AI systems.

Core views on AI safety: when, why, what, and how

Abstract

PDF