TurnTrout comments on Counting arguments provide no evidence for AI doom

TurnTrout 5 Mar 2024 6:41 UTC
LW: 0 AF: 1
0
AF
But that should never lead you to do a counting argument over function space, since that is never a sound thing to do.
Do you agree that “instrumental convergence → meaningful evidence for doom” is also unsound, because it’s a counting argument that most functions of shape Y have undesirable property X?
- evhub 5 Mar 2024 6:48 UTC
  LW: 4 AF: 3
  1
  AF Parent
  I think instrumental convergence does provide meaningful evidence of doom, and you can make a valid counting argument for it, but as with deceptive alignment you have to run the counting argument over algorithms not over functions.
  - Nora Belrose 5 Mar 2024 7:10 UTC
    4 points
    1
    Parent
    It’s not clear to me what an “algorithm” is supposed to be here, and I suspect that this might be cruxy. In particular I suspect (40-50% confidence) that:
    You think there are objective and determinate facts about what “algorithm” a neural net is implementing, where
    Algorithms are supposed to be something like a Boolean circuit or a Turing machine rather than a neural network, and
    We can run counting arguments over these objective algorithms, which are distinct both from the neural net itself and the function it expresses.
    I reject all three of these premises, but I would consider it progress if I got confirmation that you in fact believe in them.