cousin_it comments on When does rationality-as-search have nontrivial implications?

cousin_it 5 Nov 2018 19:44 UTC
4 points
AIXI-like agents can be embedded in uncomputable worlds. So I’m not sure your post has much to do with embeddedness. You’re just pointing out that AIXI is a poor metaphor when there are resource constraints, no matter if the agent is embedded or not. Sure, I agree with that.
- nostalgebraist 5 Nov 2018 20:44 UTC
  4 points
  Parent
  My argument isn’t specialized to AIXI — note that I also used LIA as an example, which has a weaker R along with a weaker S.
  
  Likewise, if you put AIXI in a world whose parts can do uncomputable things (like AIXI), you have the same pattern one level up. Your S is stronger, with uncomptable strategies, but by the same token, you lose AIXI’s optimality. It’s only searching over computable strategies, and you have to look at all strategies (including the uncomputable ones) to make sure you’re optimal. This leads to a rule R distinct from AIXI, just as AIXI is distinct from a Turing machine.
  
  I guess it’s conceivable that this hits a fixed point at this level or some higher level? That would be abstractly interesting but not very relevant to embeddedness in the kind of world I think I inhabit.
  - cousin_it 5 Nov 2018 21:28 UTC
    8 points
    Parent
    Have you seen papers like this one? Embedded AIXIs converge on Nash equilibrium against each other, that’s optimal enough, you don’t need to go up another level. I agree it’s not very relevant to our world, but there’s no difference in terms of embeddedness, the only difference is resource constraints.
    - nostalgebraist 6 Nov 2018 2:02 UTC
      8 points
      Parent
      I was not aware of these results—thanks. I’d glanced at the papers on reflective oracles but mentally filed them as just about game theory, when of course they are really very relevant to the sort of thing I am concerned with here.
      
      We have a remaining semantic disagreement. I think you’re using “embeddedness” quite differently than it’s used in the “Embedded World-Models” post. For example, in that post (text version):
      
      In a traditional Bayesian framework, “learning” means Bayesian updating. But as we noted, Bayesian updating requires that the agent start out large enough to consider a bunch of ways the world can be, and learn by ruling some of these out.
      
      Embedded agents need resource-limited, logically uncertain updates, which don’t work like this.
      
      Unfortunately, Bayesian updating is the main way we know how to think about an agent progressing through time as one unified agent. The Dutch book justification for Bayesian reasoning is basically saying this kind of updating is the only way to not have the agent’s actions on Monday work at cross purposes, at least a little, to the agent’s actions on Tuesday.
      
      Embedded agents are non-Bayesian. And non-Bayesian agents tend to get into wars with their future selves.
      
      The 2nd and 4th paragraphs here are clearly false for reflective AIXI. And the 2nd paragraph implies that embedded agents are definitionally resource-limited. There is a true and important sense in which reflective AIXI can be “embedded”—that was the point of coming up with it! -- but the Embedded Agency sequence seems to be excluding this kind of case when it talks about embedded agents. This strikes me as something I’d like to see clarified by the authors of the sequence, actually.
      
      I think the difference may be that we talk about “a theory of rationality for embedded agents,” we could mean “a theory that has consequences for agents equally powerful to it,” or we could mean something more like “a theory that has consequences for agents of arbitrarily low power.” Reflective AIXI (as a theory of rationality) explains why reflective AIXI (as an agent) is optimally designed, but it can’t explain why a real-world robot might or might not be optimally designed.