Charlie Steiner answers Egan’s Theorem?

Charlie Steiner Sep 13, 2020, 6:50 PM
LW: 11 AF: 3
AF
The answer to the question you actually asked is no, there is no ironclad guarantee of properties continuing, nor any guarantee that there will be a simple mapping between theories. With some effort you can construct some perverse Turing machines with bad behavior.
But the answer the more generalized question is yes, simple properties can be expected (in a probabilistic sense) to generalize even if the model is incomplete. This is basically Minimum Message Length prediction, which you can put on the theoretical basis of the Solomonoff prior (It’s somewhere in Li and Vitanyi—chapter 5?).
What links here?
- Noosphere89's comment on Benito’s Shortform Feed by Ben Pace (Jan 25, 2025, 3:17 PM; 7 points)
- Noosphere89's comment on What Is The Alignment Problem? by johnswentworth (Jan 16, 2025, 1:57 AM; 5 points)
- johnswentworth Sep 13, 2020, 6:59 PM
  LW: 2 AF: 1
  AF Parent
  there is no ironclad guarantee of properties continuing
  Properties continuing is not what I’m asking about. The example in the OP is relevant: even if the entire universe undergoes some kind of phase change tomorrow and the macroscopic physical laws change entirely, it would still be true that the old laws did work before the phase change, and any new theory needs to account for that in order to be complete.
  nor any guarantee that there will be a simple mapping between theories
  I do not know of any theorem or counterexample which actually says this. Do you?
  simple properties can be expected (in a probabilistic sense) to generalize even if the model is incomplete
  Similar issue to “no ironclad guarantee of properties continuing”: I’m not asking about properties generalizing to other parts of the environment, I’m asking about properties generalizing to any theory or model which describes the environment.
  - Charlie Steiner Sep 13, 2020, 9:29 PM
    LW: 3 AF: 1
    AF Parent
    If by “account for that” you mean not be in direct conflict with earlier sense data, then sure. All tautologies about the data will continue to be true. Suppose some data can be predicted by classical mechanics with 75% accuracy. This is a tautology given the data itself, and no future theory will somehow make classical mechanics stop giving 75% accurate predictions for that past data.
    Maybe that’s all you meant?
    I’d sort of interpreted you as asking questions about properties of the theory. E.g. “this data is really well explained by the classical mechanics of point particles, therefore any future theory should have a particularly simple relationship to the point particle ontology.” It seems like there shouldn’t be a guaranteed relationship that’s much simpler than reconstructing the data and recomputing the inferred point particles.
    I spent a little while trying to phrase this in terms of Turing machines but I don’t think I quite managed to capture the spirit.
    What links here?
    Noosphere89's comment on What Is The Alignment Problem? by johnswentworth (Jan 16, 2025, 1:57 AM; 5 points)
    - johnswentworth Sep 13, 2020, 10:17 PM
      LW: 4 AF: 1
      0
      AF Parent
      It seems like there shouldn’t be a guaranteed relationship that’s much simpler than reconstructing the data and recomputing the inferred point particles.
      Yeah, I’m claiming exactly the opposite of this. When the old theory itself has some simple structure (e.g. classical mechanics), there should be a guaranteed relationship that’s much simpler than reconstructing the data and recomputing the inferred point particles.
      One possible formulation: if I find that a terabyte of data compresses down to a gigabyte, and then I find a different model which compresses it down to 500MB, there should be a relationship between the two models which can be expressed without expanding out the whole terabyte. (Or, if there isn’t such a relationship, that means the two models are capturing different patterns from the data, and there should exist another model which compresses the data more than either by capturing the patterns found by both models.)
      - Charlie Steiner Sep 14, 2020, 1:20 AM
        LW: 5 AF: 2
        AF Parent
        Right, it’s a little tricky to specify exactly what this “relationship” is. Is the notion that you should be able to compress the approximate model, given an oracle for the code of the best one (i.e. that they share pieces?). Because most Turing machines don’t compress well, and so it’s easy to find counterexamples (the most straightforward class is where the approximate model is already extremely simple).
        Anyhow, like I said, hard to capture the spirit of the problem. But when I *do* try to formalize the problem, it tends to not have the property, which is definitely driving my intuition.
        What links here?
        Noosphere89's comment on What Is The Alignment Problem? by johnswentworth (Jan 16, 2025, 1:57 AM; 5 points)
        johnswentworth Sep 14, 2020, 3:58 AM
        LW: 2 AF: 1
        AF Parent
        I’d expect Turing machines to be a bad way to model this. They’re inherently blackboxy; the only “structure” they make easy to work with is function composition. The sort of structures relevant here don’t seem like they’d care much about function boundaries. (This is why I use models like these as my default model of computation these days.)
        Anyway, yeah, I’m still not sure what the “relationship” should be, and it’s hard to formulate in a way that seems to capture the core idea.