Emergent Deception and Emergent Optimization
Lesswrong, February 20, 2023
Abstract
[Note: this post was drafted before Sydney (the Bing chatbot) was released, but Sydney demonstrates some particularly good examples of some of the issues I discuss below. I’ve therefore added a few S…
