works
Joe Carlsmith On “first critical tries” in AI alignment online AI Alignment Forum

On “first critical tries” in AI alignment

Joe Carlsmith

LessWrong, June 5, 2024

Abstract

AI Alignment Forum

PDF

First page of PDF