Ah, good point! I have a feeling this is a central issue that is hardly discussed here (or anywhere)
Will MacAskill calls this the “actual alignment problem”
Wei Dai has written a lot about related concerns in posts like The Argument from Philosophical Difficulty
Ah, good point! I have a feeling this is a central issue that is hardly discussed here (or anywhere)
Will MacAskill calls this the “actual alignment problem”
Wei Dai has written a lot about related concerns in posts like The Argument from Philosophical Difficulty