lukeprog comments on Introducing Corrigibility (an FAI research subfield)

lukeprog 2 Nov 2014 23:14 UTC
0 points
Not sure if this is what you’re thinking of, but there’s a research area called “adjustable autonomy” and a few other names, which superficially sounds similar but isn’t actually getting at the problem described here, which comes about due to convergent instrumental values in sufficiently advanced agents.