pengvado comments on Friendly AI ideas needed: how would you ban porn?

pengvado Mar 27, 2014, 12:10 AM
0 points
There are many such operators, and different ones give different answers when presented with the same agent. Only a human utility function distinguishes the right way of interpreting a human mind as having a utility function from all of the wrong ways of interpreting a human mind as having a utility function. So you need to get a bunch of Friendliness Theory right before you can bootstrap.
- Squark Mar 27, 2014, 7:02 PM
  0 points
  Parent
  Why do you think there are many such operators? Do you believe the concept of “utility function of an agent” is ill-defined (assuming the “agent” is actually an intelligent agent rather than e.g. a rock)? Do you think it is possible to interpret a paperclip maximizer as having a utility function other than maximizing paperclips?
  - Stuart_Armstrong Mar 31, 2014, 11:09 AM
    0 points
    Parent
    Deducing the correct utility of a utility maximiser is one thing (which has a low level of uncertainty, higher if the agent is hiding stuff). Assigning a utility to an agent that doesn’t have one is quite another.
    
    See http://lesswrong.com/lw/6ha/the_blueminimizing_robot/ Key quote:
    
    The robot is a behavior-executor, not a utility-maximizer.
    - Squark Mar 31, 2014, 11:58 AM
      0 points
      Parent
      Replied in the other thread.