works
Holden Karnofsky Why would AI "aim" to defeat humanity? online Today’s AI development methods risk training AIs to be deceptive, manipulative and ambitious. This might not be easy to fix as it comes up.

Why would AI "aim" to defeat humanity?

Holden Karnofsky

Cold Takes, November 29, 2022

Abstract

Today’s AI development methods risk training AIs to be deceptive, manipulative and ambitious. This might not be easy to fix as it comes up.

PDF

First page of PDF