William_S comments on Poker example: (not) deducing someone’s preferences

William_S 13 Jun 2018 18:53 UTC
3 points
In the higher dimensional belief/reward space, do you think that it would be possible to significantly narrow down the space of possibilities (so this argument is saying “be bayesian with respect to reward/beliefs, picking policies that work over a distribution) or are you more pessimistic than that, thinking that the uncertainty would be so great in higher dimensional spaces that it would not be possible to pick a good policy?
- Stuart_Armstrong 13 Jun 2018 20:18 UTC
  10 points
  Parent
  I think we need to add other assumptions to narrow down the search space.