Wei Dai comments on Morality is Scary

Wei Dai 3 Dec 2021 3:34 UTC
9 points
The point I was trying to make with the quote is that many people are not motivated to do “rational reflection on morality” or examine their value systems to see if they would “survive full logical and empirical information”. In fact they’re motivated to do the opposite, to protect their value systems against such reflection/examination. I’m worried that alignment researchers are not worried enough that if an alignment scheme causes the AI to just “do what the user wants”, that could cause a lock-in of crazy value systems that wouldn’t survive full logical and empirical information.
What links here?
- Wei Dai's comment on Considerations on interaction between AI and expected value of the future by Beth Barnes (14 Dec 2021 1:02 UTC; 11 points)