Quill_McGee comments on Siren worlds and the perils of over-optimised search

Quill_McGee 9 Apr 2014 6:08 UTC
2 points
http://www.fungible.com/respect/index.html This looks to be very related to the idea of “Observe someone’s actions. Assume they are trying to accomplish something. Work out what they are trying to accomplish.” Which seems to be what you are talking about.
- [deleted] 9 Apr 2014 8:08 UTC
  1 point
  Parent
  That looks very similar to what I was writing about, though I’ve tried to be rather more formal/mathematical about it instead of coming up with ad-hoc notions of “human”, “behavior”, “perception”, “belief”, etc. I would want the learning algorithm to have uncertain/probabilistic beliefs about the learned utility function, and if I was going to reason about individual human minds I would rather just model those minds directly (as done in Indirect Normativity).