Roko comments on A Nonconstructive Existence Proof of Aligned Superintelligence

Roko 17 Sep 2024 9:50 UTC
2 points
0

I’m doubtful that such an assumption holds

Why?
- xpym 19 Sep 2024 8:51 UTC
  1 point
  0
  Parent
  Because humans have incoherent preferences, and it’s unclear whether a universal resolution procedure is achievable. I like how Richard Ngo put it, “there’s no canonical way to scale me up”.
  - Roko 19 Sep 2024 17:09 UTC
    2 points
    0
    Parent
    
    humans have incoherent preferences
    
    This isn’t really a problem with alignment so there’s no need to address it here. Alignment means the transmission of a preference ordering to an action sequence. Lacking a coherent preference ordering for states of the universe (or histories, for that matter) is not an alignment problem.
    - xpym 20 Sep 2024 9:19 UTC
      1 point
      0
      Parent
      
      This isn’t really a problem with alignment
      
      I’d rather put it that resolving that problem is a prerequisite for the notion of “alignment problem” to be meaningful in the first place. It’s not technically a contradiction to have an “aligned” superintelligence that does nothing, but clearly nobody would in practice be satisfied with that.
      - Roko 21 Sep 2024 16:32 UTC
        2 points
        0
        Parent
        you can have an alignment problem without humans. E.g. two strawberries problem.