Thomas Larsen answers Best introductory overviews of AGI safety?

Thomas Larsen 13 Dec 2022 21:13 UTC
6 points
0
My favorite for AI researchers is Ajeya’s Without specific countermeasures, because I think it does a really good job being concrete about a training set up leading to deceptive alignment. It also is sufficiently non-technical that a motivated person not familiar with AI could understand the key points.
- JakubK 13 Dec 2022 22:04 UTC
  2 points
  0
  Parent
  Forgot to include this. It’s sort of a more opinionated and ML-focused version of Carlsmith’s report and has a corresponding video/talk (as does Carlsmith).