I wonder if there’s a loopiness here is which breaks the setup (the expectation I’m guessing is relative to the prediction markets probabilities? Though it seems like the market is over sensory experiences but the values are over world states in general, so maybe I’m missing something). But it seems like if I take an action and move the market at the same time, I might be able to extract a bunch of extra money and acquire outsize control.
Bidding to control the agent’s actions for the next N timesteps
This seems like it’s wasteful relative to contributing to a pool that bids on action A (or short-term policy P). I guess coordination is hard if you’re just contributing to the pool though, and all connects to the merging process you describe.
I wonder if there’s a loopiness here is which breaks the setup (the expectation I’m guessing is relative to the prediction markets probabilities? Though it seems like the market is over sensory experiences but the values are over world states in general, so maybe I’m missing something). But it seems like if I take an action and move the market at the same time, I might be able to extract a bunch of extra money and acquire outsize control.
This seems like it’s wasteful relative to contributing to a pool that bids on action A (or short-term policy P). I guess coordination is hard if you’re just contributing to the pool though, and all connects to the merging process you describe.