Jan_Kulveit comments on Intertheoretic utility comparison

Jan_Kulveit 3 Jul 2018 17:14 UTC
5 points
It seems worth mentioning than anything which involves enumerating over the space of possible actions, or policies, is often not tractable in practice (or, will be exploitable by adversarial enumeration)
So another desideratum may be “it’s easy to implement using sampling”. On this, normalizing by some sort of variance is probably best.