buybuydandavis comments on Eliezer’s YU lecture on FAI and MOR [link]

buybuydandavis 7 Mar 2013 19:05 UTC
0 points

Argues for folk theorem that in general, rational agents will preserve their utility functions during self-optimization.

The Ghandi example works because he was posited with one goal. With multiple competing goals, I’d expect some goals to lose, and having lost, be more likely to lose the next time.
- Eliezer Yudkowsky 7 Mar 2013 23:21 UTC
  1 point
  Parent
  “Utility functions.” Omohundro argues that agents which don’t have utility functions will have to acquire them. I’m not totally sure I believe this is a universal law but I suspect that something like it is true in a lot of cases, for reasons like those above.
- Shmi 7 Mar 2013 20:52 UTC
  −3 points
  Parent
  
  The Ghandi example works because he was posited with one goal.
  
  And unchanged circumstances. What would Ghandi do when faced with a trolley problem?
  - RichardHughes 7 Mar 2013 22:10 UTC
    −2 points
    Parent
    Same thing as ‘multiple competing goals’, where those goals are ‘do not be part of a causal chain that leads to the death of others’ and ‘reduce the death of others’.