TurnTrout comments on Open Problems with Myopia

TurnTrout 10 Mar 2021 19:30 UTC
LW: 3 AF: 3
AF
In some sense, agents that press the button will engage in deception; both agents trade reward now for more reward later.
I don’t understand—isn’t the opposite true here?
- Mark Xu 10 Mar 2021 19:52 UTC
  LW: 2 AF: 1
  AF Parent
  Yep—I switched the setup at some point and forgot to switch this sentence. Thanks.
  - Evan R. Murphy 20 Apr 2022 7:12 UTC
    1 point
    AF Parent
    I think there may be another leftover from the old setup:
    
    We are interested in creating agents that robustly do not press the button.
    
    Shouldn’t this be interested in creating agents that robustly do press the button? I.e. then they’re reliably myopic. Or am I misunderstanding something?
    - Mark Xu 21 Apr 2022 14:15 UTC
      2 points
      Parent
      Yep, thanks. Fixed.