TurnTrout comments on Counting arguments provide no evidence for AI doom

TurnTrout 4 Mar 2024 19:04 UTC
LW: 6 AF: 5
3
AF
By the logic of the post, step 4 is the problem, but I think step 4 is actually valid. The problem is step 2: there are actually a huge number of different ways to implement a line! Not only are there many different programs that implement the line in different ways, I can also just take the simplest program that does so and keep on adding comments or other extraneous bits.
Evan, I wonder how much your disagreement is engaging with OPs’ reasons. A draft of this post motivated the misprediction of both counting arguments as trying to count functions instead of parameterizations of functions; one has to consider the compressivity of the parameter-function map (many different internal parameterizations map to the same external behavior). Given that the authors actually agree that 2 is incorrect, does this change your views?
- evhub 4 Mar 2024 19:50 UTC
  LW: 7 AF: 5
  2
  AF Parent
  I would be much happier with that; I think that’s much more correct. Then, my objection would just be that at least the sort of counting arguments for deceptive alignment that I like are and always have been about parameterizations rather than functions. I agree that if you try to run a counting argument directly in function space it won’t work.
  - ryan_greenblatt 5 Mar 2024 2:01 UTC
    LW: 4 AF: 3
    0
    AF Parent
    See also discussion here.
  - TurnTrout 5 Mar 2024 6:49 UTC
    LW: -2 AF: -1
    −5
    AF Parent
    deceptive alignment that I like are and always have been about parameterizations rather than functions.
    How can this be true, when you e.g. say there’s “only one saint”? That doesn’t make any sense with parameterizations due to internal invariances; there are uncountably many “saints” in parameter-space (insofar as I accept that frame, which I don’t really but that’s not the point here). I’d expect you to raise that as an obvious point in worlds where this really was about parameterizations.
    And, as you’ve elsewhere noted, we don’t know enough about parameterizations to make counting arguments over them. So how are you doing that?
    - evhub 5 Mar 2024 6:54 UTC
      LW: 5 AF: 3
      4
      AF Parent
      
      How can this be true, when you e.g. say there’s “only one saint”? That doesn’t make any sense with parameterizations due to internal invariances; there are uncountably many saints.
      
      Because it was the transcript of a talk? I was trying to explain an argument at a very high level. And there’s certainly not uncountably many; in the infinite bitstring case there would be countably many, though usually I prefer priors that put caps on total computation such that there are only finitely many.
      
      I’d expect you to raise that as an obvious point in worlds where this really was about parameterizations.
      
      I don’t really appreciate the psychoanalysis here. I told you what I thought and think, and I have far more evidence about that than you do.
      
      And, as you’ve elsewhere noted, we don’t know enough about parameterizations to make counting arguments over them. So how are you doing that?
      
      As I’ve said, I usually try to take whatever the most realistic prior is that we can reason about at a high-level, e.g. a circuit prior or a speed prior.
- Nora Belrose 4 Mar 2024 19:11 UTC
  3 points
  −3
  Parent
  FWIW I object to 2, 3, and 4, and maybe also 1.