works
Geoffrey Irving and Amanda Askell AI safety needs social scientists article Properly aligning advanced AI systems with human values will require resolving many uncertainties related to the psychology of human rationality, emotion, and biases. These can only be resolved empirically through experimentation — if we want to train AI to do what humans want, we need to study humans.

AI safety needs social scientists

Geoffrey Irving and Amanda Askell

Distill, vol. 4, no. 2, 2019

Abstract

Properly aligning advanced AI systems with human values will require resolving many uncertainties related to the psychology of human rationality, emotion, and biases. These can only be resolved empirically through experimentation — if we want to train AI to do what humans want, we need to study humans.

PDF

First page of PDF