Rohin Shah comments on Locality of goals

Rohin Shah 2 Jul 2020 19:09 UTC
LW: 6 AF: 4
AF
Planned summary for the Alignment Newsletter:
This post introduces the concept of the _locality_ of a goal, that is, how “far” away the target of the goal is. For example, a thermometer’s “goal” is very local: it “wants” to regulate the temperature of this room, and doesn’t “care” about the temperature of the neighboring house. In contrast, a paperclip maximizer has extremely nonlocal goals, as it “cares” about paperclips anywhere in the universe. We can also consider whether the goal depends on the agent’s internals, its input, its output, and/or the environment.
The concept is useful because for extremely local goals (usually goals about the internals or the input) we would expect wireheading or tampering, whereas for extremely nonlocal goals, we would instead expect convergent instrumental subgoals like resource acquisition.
- adamShimi 2 Jul 2020 21:09 UTC
  LW: 1 AF: 1
  AF Parent
  Thanks for the summary! It’s representative of the idea.
  Just by curiosity, how do you decide for which posts/paper you want to write an opinion?
  - Rohin Shah 2 Jul 2020 22:07 UTC
    LW: 2 AF: 2
    AF Parent
    I ask myself if there’s anything in particular I want to say about the post / paper that the author(s) didn’t say, with an emphasis on ensuring that the opinion has content. If yes, then I write it.
    (Sorry, that’s not very informative, but I don’t really have a system for it.)
    - adamShimi 2 Jul 2020 22:14 UTC
      LW: 1 AF: 1
      AF Parent
      No worries, that’s a good answer. I was just curious, not expecting a full-fledged system. ;)