AI “safety” vs “control” vs “alignment”

Paul Christiano

AI Alignment, November 19, 2016

Abstract

AI safety, control, and alignment are increasingly specific problems related to the risks posed by AI, especially powerful AI. AI safety encompasses all these problems and more. AI control ensures that AI systems attempt to do the right thing by preventing them from pursuing the wrong one. Value alignment entails understanding how to create AI systems that align with human preferences. – AI-generated abstract.

AI “safety” vs “control” vs “alignment”

Abstract

PDF