I think the idea of having additional agents B (and C) to act as a form of control is definitely worth pursuing, though I am not clear how it would be implemented.
Is ‘w’ just random noise added to the max value of u?
If so, would this just act as a limiter and eventually it would find a result close to the original max utility anyway once the random noise falls close to zero?
Specifying v is part of the challenge. But by “noise” I means a whole other utility function added permanently on to u. It would not “fall”, it would be a permanent feature of v.
I think the idea of having additional agents B (and C) to act as a form of control is definitely worth pursuing, though I am not clear how it would be implemented.
Is ‘w’ just random noise added to the max value of u?
If so, would this just act as a limiter and eventually it would find a result close to the original max utility anyway once the random noise falls close to zero?
Specifying v is part of the challenge. But by “noise” I means a whole other utility function added permanently on to u. It would not “fall”, it would be a permanent feature of v.