Stuart_Armstrong comments on An overall schema for the friendly AI problems: self-referential convergence criteria

Stuart_Armstrong 20 Jul 2015 9:39 UTC
0 points
Groups converge as well. We can’t assume AI groups will have the barriers to convergence that human groups currently do (just as we can’t assume that AIs have the barriers to convergence that humans do).
- TheAncientGeek 21 Jul 2015 8:24 UTC
  0 points
  Parent
  I’m not doubting that groups converge, I am arguing that when a group achieves reflective equilibrium, that is much more meaningful than a singleton doing so, at least as long as there is variation within the group.
  - Stuart_Armstrong 21 Jul 2015 10:35 UTC
    0 points
    Parent
    There are bad ways to achieve group convergence.
    - TheAncientGeek 21 Jul 2015 12:38 UTC
      0 points
      Parent
      In absolute terms, maybe, but that doesn’t stop it being relatively better.
      - Stuart_Armstrong 21 Jul 2015 13:49 UTC
        0 points
        Parent
        What you are trying to do is import positive features from the convergence of human groups (eg the fact that more options are likely to have been considered, the fact that productive discussion is likely to have happened...) into the convergence of AI groups, without spelling them out precisely. Unless we have a clear handle on what, among humans, causes these positive features, we have no real reason to suspect they will happen in AI groups as well.
        TheAncientGeek 21 Jul 2015 16:49 UTC
        0 points
        Parent
        The two concrete examples you gave weren’t what I had in mind. I was addressing the problem of an AI “losing” values during extrapolation,and it looks like a real reason to me. If you want to prevent an AI undergoing value drift during extrapolation, keep an extrapolated one as a reference. Two is a group minimally.
        
        There may well be other advantages to doing rationality and ethics in groups, and yes, that needs research, and no, that isnt a show stopper.