RHollerith comments on moridinamael’s Shortform

RHollerith 27 Apr 2022 5:38 UTC
2 points
Are we talking about an agent that is uncertain about its own utility function or about an agent that is uncertain about another agent’s?
- RHollerith 27 Apr 2022 6:07 UTC
  2 points
  Parent
  You are probably talking about the former. What would count as evidence about the uncertain utility function?
  - moridinamael 27 Apr 2022 21:47 UTC
    2 points
    Parent
    Yes, the former. If the agent takes actions and receives reward, assuming it can see the reward, then it will gain evidence about its utility function.
    - RHollerith 8 May 2022 20:17 UTC
      2 points
      Parent
      Probably you already know this, but the framework known as reinforcement learning is very relevant here. In particular, there are probably web pages that describe how to compute the expected utility of a (strategy, reward function) pair.