I get the impression that AIXI re-computes it’s actions at every time-step, so it can’t pre-commit to paying the CM. I’m not sure if this is an accurate interpretation though.
Something equivalent to precommitment is: it just being in your nature to trust counterfactual muggers. Then, recomputing your actions in every time-step is fine, and it doesn’t necessarilly indicate that you don’t have a nature that alllows you to pay counterfactual muggers.
I’m not sure if AIXI has a “nature”/personality as such though? I suppose this might be encoded in the initial utility function somehow, but I’m not sure if it’s feasible to include all these kinds of scenarios in advance.
That agent “recomputes decisions” is in any case not a valid argument for it being unable to precommit. Precommitment through inability to render certain actions is a workaround, not a necessity: a better decision theory won’t be performing those actions of its own accord.
So: me neither—I was only saying that arguing from “recomputing its actions at every time-step”, to “lacking precommitment” was an invalid chain of reasoning.
I get the impression that AIXI re-computes it’s actions at every time-step, so it can’t pre-commit to paying the CM. I’m not sure if this is an accurate interpretation though.
Something equivalent to precommitment is: it just being in your nature to trust counterfactual muggers. Then, recomputing your actions in every time-step is fine, and it doesn’t necessarilly indicate that you don’t have a nature that alllows you to pay counterfactual muggers.
I’m not sure if AIXI has a “nature”/personality as such though? I suppose this might be encoded in the initial utility function somehow, but I’m not sure if it’s feasible to include all these kinds of scenarios in advance.
That agent “recomputes decisions” is in any case not a valid argument for it being unable to precommit. Precommitment through inability to render certain actions is a workaround, not a necessity: a better decision theory won’t be performing those actions of its own accord.
So: me neither—I was only saying that arguing from “recomputing its actions at every time-step”, to “lacking precommitment” was an invalid chain of reasoning.