Stuart_Armstrong comments on ALBA: can you be “aligned” at increased “capacity”?

Stuart_Armstrong 14 Apr 2017 9:00 UTC
0 points
AF

that is able to give “better” feedback than the human could – feedback that considers more consequences more accurately in more complex decision situations, that has spent more effort introspecting

This is roughly how I would run ALBA in practice, and why I said it was better in practice than in theory. I’d be working with considerations I mentioned in this post and try and formalise how to extend utilities/rewards to new settings.
- danieldewey 14 Apr 2017 20:58 UTC
  LW: 1 AF: 1
  AF Parent
  If I read Paul’s post correctly, ALBA is supposed to do this in theory—I don’t understand the theory/practice distinction you’re making.
  - Stuart_Armstrong 17 Apr 2017 14:47 UTC
    0 points
    AF Parent
    I disagree. I’m arguing that the concept of “aligned at a certain capacity” makes little sense, and this is key to ALBA in theory.