Victor Novikov comments on Godshatter Versus Legibility: A Fundamentally Different Approach To AI Alignment

Victor Novikov 12 Apr 2022 12:59 UTC
2 points
Where? By whom?
You know, I’m not sure I remember. You tend to pick this stuff up if you hang around LW long enough.
I’ve tried to find a primer. The Superintelligent Will by Nick Bostrom seems good.
Why would you possibly assume that deep, intelligent understanding of life, consciousness, joy and suffering has 0 correlation with caring about these things?
The orthogonality thesis (also part of the paper I linked above).
Edit: also, this video was recommended to me.
- Rafael Harth 12 Apr 2022 15:36 UTC
  3 points
  Parent
  The orthogonality thesis says that an AI can have any combination of intelligence and goals, not that $P (goal = x | intelligence = y) = P (goal = x)$ for all $x$ and $y$ . It depends entirely on how the AI is built. People like Rohin Shah assign significant probability on alignment by default, at least last I heard.
- jimmy 13 Apr 2022 5:24 UTC
  2 points
  Parent
  It’s worth noting that (and the video acknowledges that) “Maybe it’s more like raising a child than putting a slave to work” is a very very different statement than “You just have to raise it like a kid”.
  In particular, there is no “just” about raising a kid to have good values—especially when the kid isn’t biologically yours and quickly grows to be more intelligent than you are.