Rohin Shah comments on Let’s talk about “Convergent Rationality”

Rohin Shah 2 Jan 2020 17:51 UTC
LW: 2 AF: 2
AF
(non-hypothetical Q): What about if it has a horizon of 10^-8s? Or 0?
With 0, the AI never does anything and so is basically a rock. With 10^-8, it still seems rational and competently goal-directed to me, just with weird-to-me preferences.
I believe time-inconsistent preferences are something that many would say seem irrational
Really? I feel like that at least depends on what the preference is. I could totally imagine that people have preferences to e.g. win at least one Olympic medal, but further medals are less important (which is history-dependent), be the youngest person to achieve <some achievement> (which is finite horizon), eat ice cream in the next half hour (but not care much after that).
You might object that all of these can be made state-dependent, but you can make your example state-dependent by including the current time in the state.
I agree that we are probably not going to build superintelligent AIs that have a horizon of 10^-8s, just because our preferences don’t have horizons of 10^-8s, and we’ll try to build AIs that optimize our preferences.
- David Scott Krueger (formerly: capybaralet) 3 Jan 2020 6:19 UTC
  LW: 1 AF: 1
  AF Parent
  With 0, the AI never does anything and so is basically a rock
  I’m trying to point at “myopic RL”, which does, in fact, do things.
  You might object that all of these can be made state-dependent, but you can make your example state-dependent by including the current time in the state.
  I do object, and still object, since I don’t think we can realistically include the current time in the state. What we can include is: an impression of what the current time is, based on past and current observations. There’s an epistemic/indexical problem here you’re ignoring.
  I’m not an expert on AIXI, but my impression from talking to AIXI researchers and looking at their papers is: finite-horizon variants of AIXI have this “problem” of time-inconsistent preferences, despite conditioning on the entire history (which basically provides an encoding of time). So I think the problem I’m referring to exists regardless.
  - Rohin Shah 3 Jan 2020 8:36 UTC
    LW: 3 AF: 3
    AF Parent
    I’m trying to point at “myopic RL”, which does, in fact, do things.
    Ah, an off-by-one miscommunication. Sure, it’s both rational and competently goal-directed.
    I do object, and still object, since I don’t think we can realistically include the current time in the state.
    I mean, if you want to go down that route, then “win at least one medal” is also not state-dependent, because you can’t realistically include “whether Alice has won a medal” in the state: you can only include an impression of whether Alice has won a medal, based on past and current observations. So I still have the same objection.
    finite-horizon variants of AIXI have this “problem” of time-inconsistent preferences
    Oh, I see. You probably mean AI systems that act as though they have goals that will only last for e.g. 5 seconds. Then, 2 seconds later, they act as though they have goals that will last for 5 more seconds, i.e. 7 seconds after the initial time. (I was thinking of agents that initially care about the next 5 seconds, and then after 2 seconds, they care about the next 3 seconds, and after 7 seconds, they don’t care about anything.)
    I agree that the preferences you were talking about are time-inconsistent, and such agents seem both less rational and less competently goal-directed to me.