Dr_Manhattan comments on Stuart Russell: AI value alignment problem must be an “intrinsic part” of the field’s mainstream agenda

Dr_Manhattan 26 Nov 2014 14:10 UTC
0 points

Human beings do not have values that are provably aligned with the values of other human beings.

Sure, but we “happily” compromise. AI should be able to understand and implement the compromise that is overall best for everyone.

Any AI that does value something infinitely will not have human values

AI can value the “best compromise” infinitely :). But agreed nothing else.
- Unknowns 26 Nov 2014 14:46 UTC
  1 point
  Parent
  I’m not sure what it would mean exactly to value the best compromise infinitely, since part of that compromise would be the refusal to accept a sufficiently bad Mugging, which implies a utility bound.
- TheAncientGeek 26 Nov 2014 14:22 UTC
  0 points
  Parent
  But if an AI can compromise on some fuzzy or simplified set if values, what happened to the full complexity and fragility of human value?
  - Dr_Manhattan 26 Nov 2014 21:30 UTC
    1 point
    Parent
    Why does the compromise have to be a function of simplified values? I don’t think I implied that.