Stuart_Armstrong comments on An overall schema for the friendly AI problems: self-referential convergence criteria

Stuart_Armstrong 21 Jul 2015 13:49 UTC
0 points
What you are trying to do is import positive features from the convergence of human groups (eg the fact that more options are likely to have been considered, the fact that productive discussion is likely to have happened...) into the convergence of AI groups, without spelling them out precisely. Unless we have a clear handle on what, among humans, causes these positive features, we have no real reason to suspect they will happen in AI groups as well.
- TheAncientGeek 21 Jul 2015 16:49 UTC
  0 points
  Parent
  The two concrete examples you gave weren’t what I had in mind. I was addressing the problem of an AI “losing” values during extrapolation,and it looks like a real reason to me. If you want to prevent an AI undergoing value drift during extrapolation, keep an extrapolated one as a reference. Two is a group minimally.
  
  There may well be other advantages to doing rationality and ethics in groups, and yes, that needs research, and no, that isnt a show stopper.