halcyon comments on Open Thread Feb 22 - Feb 28, 2016

halcyon 24 Feb 2016 14:24 UTC
0 points
Interesting. In that case, would you say an AI that provably implements CEV’s replacement is, for that reason, provably Friendly? That is, AIs implementing CEV’s replacement form an analytical subset of Friendly AIs? What is the current replacement for CEV anyway? Having some technical material would be even better. If it’s open to the public, then I’d like to understand how EY proposes to install a general framework similar to CEV at the “initial dynamic” stage that can predictably generate a provably Friendly AI without explicitly modeling the target of its Friendliness.
- Kaj_Sotala 26 Feb 2016 17:45 UTC
  2 points
  Parent
  
  What is the current replacement for CEV anyway?
  
  There isn’t really one as far as I know; “The Value Learning Problem” discusses some of the questions involved, but seems to mostly at be the point of defining the problem rather than trying to answer it. (This seems appropriate to me; trying to answer the problem at this point seems premature.)
  - halcyon 28 Feb 2016 15:32 UTC
    2 points
    Parent
    Thanks. That makes sense to me.
- ChristianKl 24 Feb 2016 14:38 UTC
  1 point
  Parent
  
  Interesting. In that case, would you say an AI that provably implements CEV’s replacement is, for that reason, provably Friendly?
  
  I think that’s MIRI’s usage of the term friendly.
  
  If it’s open to the public, then I’d like to understand how EY proposes to install a general framework similar to CEV at the “initial dynamic” stage
  
  He’s not proposing a mechanism as far as I know. That’s another open problem.
- Gunnar_Zarncke 24 Feb 2016 21:30 UTC
  0 points
  Parent
  See Miris research for details.