mocny-chlapik comments on Is AI Alignment a pseudoscience?

mocny-chlapik 24 Jan 2022 10:21 UTC
1 point
I think that AI Safety can be a subfield of AI Alignment, however I see a distinction between AI as current ML models and AI as theoretical AGI.
- Martin Randall 24 Jan 2022 14:16 UTC
  1 point
  Parent
  Okay, so “AI Alignment (of current AIs)” is scientific and rigorous and falsifiable, but “AGI Alignment” is a fictional world-building exercise?
  - mocny-chlapik 24 Jan 2022 15:12 UTC
    1 point
    Parent
    Yeah, that is somewhat my perception.
    - Koen.Holtman 25 Jan 2022 21:19 UTC
      3 points
      Parent
      In physics, we can try to reason about black holes and the big bang by inserting extreme values into the equations we know as the laws of physics, laws we got from observing less extreme phenomena. Would this also be ‘a fictional-world-building exercise’ to you?
      
      Reasoning about AGI is similar to reasoning about black holes: both of these do not necessarily lead to pseudo-science, though both also attract a lot of fringe thinkers, and not all of them think robustly all of the time.
      
      In the AGI case, the extreme value math can be somewhat trivial, if you want it. One approach is to just take the optimal policy $π^{*}$ defined by a normal MDP model, and assume that the AGI has found it and is using it. If so, what unsafe phenomena might we predict? What mechanisms could we build to suppress these?