MugaSofer comments on Domesticating reduced impact AIs

MugaSofer 19 Feb 2013 12:15 UTC
−2 points

So it assigns a divergent weight to actions that benefit divergently from the ultraviolet catastrophe, and builds a infinite-power computer that it knows won’t work.

How is this different to accepting a bet it “knows” it will lose? We may know with certainty that it doesn’t live in a classical universe, because we specified the problem, but the AI doesn’t.
- Manfred 19 Feb 2013 12:35 UTC
  0 points
  Parent
  Well, from the perspective of the AI, it’s behaving perfectly rationally. It finds the highest-probability thing that could give it infinite reward, and then prepares for that, no matter how small the probability is. It only seems strange to us humans because (1) we’re Allais-ey, and (2) it is a clear case of logical, one-shot probability, which is less intuitive.
  
  If our AI models the world with one set of laws at a time, rather than having a probability distribution over laws, then this behavior could pop up as a surprise.
  - MugaSofer 19 Feb 2013 13:54 UTC
    −2 points
    Parent
    
    It only seems strange to us humans because (1) we’re Allais-ey, and (2) it is a clear case of logical, one-shot probability, which is less intuitive.
    
    Precisely. That’s all I was saying.