TurnTrout comments on Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

TurnTrout 24 Jul 2022 23:31 UTC
LW: 6 AF: 4
0
AF
But why would that strategy be selected? Where does the selection come from? Will the designers toss a really impressive AI for not getting reward on that one timestep? I think not.
And once it has the ability, it is likely to be pushed in the direction of exercising that ability (since doing so would increase its reward).
Why? I maintain that the agent would not do so, unless it were already terminally motivated by reward. For empirical example, some neuroscientists know that brain stimulation reward leads to higher reward, and the brain very likely does some kind of reinforcement learning, so why don’t neuroscientists wirehead themselves?
- Ajeya Cotra 25 Jul 2022 1:45 UTC
  LW: 6 AF: 2
  2
  AF Parent
  
  Where does the selection come from? Will the designers toss a really impressive AI for not getting reward on that one timestep? I think not.
  
  I was talking about gradient descent here, not designers.