quetzal_rainbow comments on Evolution Solved Alignment (what sharp left turn?)

quetzal_rainbow 15 Oct 2023 21:45 UTC
2 points
1
Again none of those future scenarios have played out, they aren’t evidence yet, just speculation
It’s very weird notion of what constitutes an evidence. If you built AGI and your interpretability tools show you that AGI is plotting to kill you, it would be pretty hard evidence in favor of sharp left turn, even if you were still alive.
- jacob_cannell 15 Oct 2023 22:30 UTC
  0 points
  0
  Parent
  Not at all—just look at the definition of solomonoff induction: the distribution over world models/theories is updated strictly on new historical observation bits, and never on future predicted observations. If you observe mental states inside the AGI, that is naturally valid observational evidence from your pov. But that is very different from you predicting that the AGI is going to kill you, and then updating your world model based on those internal predictions—those feedback loops rapidly diverge from reality (and may be related to schizophrenia).
  - quetzal_rainbow 16 Oct 2023 10:53 UTC
    3 points
    2
    Parent
    I’m talking about observable evidence, like, transhumanists claiming they will drop their biological bodies on first possibility.