TAG comments on The Alignment Problem

TAG 11 Jul 2022 17:17 UTC
0 points
−8

We [humanity] will build the most powerful AIs we can.

We [humanity] will build the most powerful AIs we can control. Power without control is no good to anybody. Cars without brakes go faster than cars with brakes, but theres not market for them.
- Kenoubi 11 Jul 2022 18:10 UTC
  14 points
  12
  Parent
  We’ll build the most powerful AI we think we can control. Nothing prevents us from ever getting that wrong. If building one car with brakes that don’t work made everyone in the world die in a traffic accident, everyone in the world would be dead.
  - trevor 11 Jul 2022 20:46 UTC
    11 points
    9
    Parent
    There’s also the problem of an AGI consistently exhibiting aligned behavior due to low risk tolerance, until it stops doing that (for all sorts of unanticipated reasons).
    This is especially compounded by the current paradigm of brute forcing randomly generated-neural networks, since the resulting systems are fundamentally unpredictable and unexplainable.
    - trevor 30 Nov 2023 0:39 UTC
      4 points
      2
      Parent
      Retracted because I used the word “fundamentally” incorrectly, resulting in a mathematically provably false statement (in fact it might be reasonable to assume that neutral networks are both fundamentally predictable and even fundamentally explainable, although I can’t say for sure since as of Nov 2023 I don’t have a sufficient understanding of Chaos theory). They sure are unpredictable and unexplainable right now, but there’s nothing fundamental about that.
      This comment shouldn’t have been upvoted by anyone. It said something that isn’t true.
  - TAG 14 Jul 2022 0:57 UTC
    −1 points
    −2
    Parent
    So how did we get from narrow AI to super powerful AI? Foom? But we can build narrow AIs that don’t foom, because we have. We should be able to build narrow AIs that don’t foom by not including anything that would allow them to recursively self improve [*].
    
    EY’s answer to the question “why isn’t narrow AI safe” wasn’t “narrow AI will foom”, it was “we won’t be motivated to keep AI’s narrow”.
    
    [*] not that we could tell them how to self-improve, since we don’t really understand it ourselves.
- Signer 11 Jul 2022 18:05 UTC
  2 points
  5
  Parent
  Being on the frontier of controllable power means we need to increase power only slightly to stop being in control—it’s not a stable situation. It works for cars because someone risking to use cheaper brakes doesn’t usually destroy planet.
  - TAG 14 Jul 2022 1:00 UTC
    0 points
    −1
    Parent
    
    Being on the frontier of controllable power means we need to increase power only slightly to stop being in control
    
    Slightly increasing power generally means slightly decreasing control, most of the time. What causes the very non-linear relationship you are assuming? Foom? But we can build narrow AIs that don’t foom, because we have. We should be able to build narrow AIs that don’t foom by not including anything that would allow them to recursively self improve [*].
    
    EY’s answer to the question “why isn’t narrow AI safe” wasn’t “narrow AI will foom”, it was “we won’t be motivated to keep AI’s narrow”.
    
    [*] not that we could tell them how to self-improve, since we don’t really understand it ourselves.
    - Signer 14 Jul 2022 6:37 UTC
      1 point
      0
      Parent
      
      What causes the very non-linear relationship you are assuming?
      
      Advantage of offence over defense in high-capability regime—you need only cross one threshold like “can finish a plan to rowhammer itself to internet” or “can hide its thoughts before it is spotted”. And we will build non-narrow AI because in practice “most powerful AIs we can control” means “we built some AIs, we can control them, so we continue to do what we have done before” and not “we try to understand what we will not be able to control in the future and try not to do it” because we already don’t check whether our current AI will be general before we turn it on and we already explicitly trying to create non-narrow AI.