Ben Pace comments on List of resolved confusions about IDA

Ben Pace 9 Oct 2019 0:17 UTC
LW: 4 AF: 2
AF
You have a section titled
learning user preferences for corrigibility isn’t enough for corrigible behavior
Would this be more consistently titled “Learning narrow preferences for corrigibility isn’t enough for corrigible behavior”?