interstice comments on K-complexity is silly; use cross-entropy instead

interstice 21 Dec 2022 19:29 UTC
6 points
0

Like, can we specialize this new program to a program that’s just ‘dovetailing’ across all possible gauge-choices and then running physics on ?

You could specialize to just dovetailing across gauges, but this might make the program longer since you’d need to specify physics.

So it presumably has to pick one of them to “predict”, but how is it doing this?

I don’t think you need to choose a particular history to predict since all observables are gauge-invariant. So say we’re trying to predict some particular sequence of observables in the universe. You run your search over all programs, the search includes various choices of gauge/initialization. Since the observables are gauge-invariant each of those choices of gauge generate the same sequence of observables, so their algorithmic probability accumulates to a single string. Once the string has accumulated enough algorithmic probability we can refer to it with a short codeword(this is the tricky part, see below)

But presumably sampling isn’t good enough, because the random seed needed to hit a particular cluster of my choosing could be arbitrarily improbable

The idea is that, using a clever algorithm, you can arrange the clusters in a contiguous way inside $[0, 1]$ . Since they’re contiguous, if a particular cluster has volume $2^{- Θ (n)}$ we can refer to it with a codeword of length $Θ (n)$ .
- So8res 21 Dec 2022 19:46 UTC
  6 points
  0
  Parent
  
  this might make the program longer since you’d need to specify physics.
  
  (I doubt it matters much; the bits you use to specify physics at the start are bits you save when picking the codeword at the end.)
  
  I don’t think you need to choose a particular history to predict since all observables are gauge-invariant.
  
  IIUC, your choice of the wavefunction and your choice of the gauge are interlocked. The invariant is that if you change the gauge and twiddle the wavefunction in a particular way, then no observables change. If you’re just iterating over (gauge, wavefunction) pairs then (iiuc) you’re going to hit lots of pairs that aren’t making the correct predictions about the observables.
  
  I don’t understand the mechanism that lets you can find some clever way to check which (gauge, wavefunction) pairs are legal (or at least haven’t-been-ruled-out-yet), nor how to make a deterministic prediction given an imperfect cluster without doing some sort of sampling.
  
  (Alternatively, it’s plausible to me that I’m thinking about it wrong.)
  
  … codeword …
  
  Yeah, I see how if you can do this dovetailing and get the clusters to be continuous, such that every cluster with $2^{k}$ programs of length $(n + k)$ has a codeword of length $n$ , then you’re done.
  
  (And it’s clear to me how to do this if you can enumerate the $2^{k}$ programs, and I know how to enumerate a superset of those programs b/c I know how to enumerate all the programs, but I don’t know how to computably distinguish members.)
  - interstice 21 Dec 2022 20:19 UTC
    6 points
    0
    Parent
    
    I don’t understand the mechanism that lets you can find some clever way to check which (gauge, wavefunction) pairs are legal
    
    I’m not really sure how gauge-fixing works in QM, so I can’t comment here. But I don’t think it matters: there’s no need to check which pairs are “legal”, you just run all possible pairs in parallel and see what observables they output. Pairs which are in fact gauge-equivalent will produce the same observables by definition, and will accrue probability to the same strings.
    
    Perhaps you’re worried that physically-different worlds could end up contributing identical strings of observables? Possibly, but (a) I think if all possible strings of observables are the same they should be considered equivalent, so that could be one way of distinguishing them (b) this issue seems orthogonal to k-complexity VS. alt-complexity.
    - So8res 21 Dec 2022 20:53 UTC
      6 points
      0
      Parent
      That gives me a somewhat clearer picture. (Thanks!) It sounds like the idea is that we have one machine that dovetails through everything and separates them into bins according to their behavior (as revealed so far), and a second machine that picks a bin.
      
      Presumably the bins are given some sort of prefix-free code, so that when a behavior-difference is revealed within a bin (e.g. after more time has passed) it can be split into two bins, with some rule for which one is “default” (e.g., the leftmost).
      
      I buy that something like this can probably be made to work.
      
      In the case of physics, what it suggests the hypothesis
      
      reality is the thing that iterates over all gauges, and applies physics to cluster #657
      
      as competing with a cluster of longer hypotheses like
      
      reality applies physics to gauge #25098723405890723450987098710987298743509
      
      Where that first program beats most elements of the cluster (b/c the gauge-identifier is so dang long in general), but the cluster as a whole may still beat that first program (by at most a fixed constant factor, corresponding roughly to the length of the iterating machine).
      
      (And my current state is something like: it’s interseting that you can think of yourself as living in one particular program, without losing more than a factor of «fixed constant» from your overall probability on what you saw. But I still don’t see why you’d want to, as opposed to realizing that you live in lots and lots of different programs.)
      - interstice 21 Dec 2022 22:20 UTC
        7 points
        0
        Parent
        
        Presumably the bins are given some sort of prefix-free code, so that when a behavior-difference is revealed within a bin (e.g. after more time has passed) it can be split into two bins, with some rule for which one is “default
        
        I only just realized that you’re mainly thinking of the complexity of semimeasures on infinite sequences, not the complexity of finite strings. I guess that should have been obvious from the OP; the results I’ve been citing are about finite strings. My bad! For semimeasures, this paper proves that there actually is a non-constant gap between the log-total-probability and description complexity. Instead the gap is bounded by the Kolmogorov complexity of the length of the sequences. This is discussed in section 4.5.4 of Li&Vitanyi.
        What links here?
        Yoav Ravid's comment on K-complexity is silly; use cross-entropy instead by So8res (26 Dec 2023 14:39 UTC; 3 points)
        So8res 21 Dec 2022 22:34 UTC
        9 points
        6
        Parent
        Cool, thanks.
        
        I’m pretty confident that the set of compatible (gauge, wavefunction) pairs is computably enumerable, so I think that the coding theorem should apply.
        
        There’s an insight that I’ve glimpsed—though I still haven’t checked the details—which is that we can guarantee that it’s possible to name the ‘correct’ (gauge, wavefunction) cluster without necessarily having to name any single gauge (as would be prohibatively expensive), by dovetailing all the (guage, wavefunction) pairs (in some representation where you can comptuably detect compatibility) and binning the compatible ones, and then giving a code that points to the desired cluster (where the length of that code goes according to cluster-size, which will in general be smaller than picking a full gauge).
        
        This does depend on your ability to distinguish when two programs belong in the same cluster together, which I’m pretty sure can be done for (gauge, wavefunction) pairs (in some suitable representation), but which ofc can’t be done in general.
        
        Thanks for the reference!
        Yoav Ravid 12 Feb 2023 15:57 UTC
        2 points
        0
        Parent
        Seems good to edit the correction in the post, so readers know that in some cases it’s not constant.
        Adam Jermyn 22 Dec 2022 5:47 UTC
        1 point
        0
        Parent
        How does this correctness check work?
        
        I usually think of gauge freedom as saying “there is a family of configurations that all produce the same observables”. I don’t think that gives a way to say some configurations are correct/incorrect. Rather some pairs of configurations are equivalent and some aren’t.
        
        That said, I do think you can probably do something like the approach described to assign a label to each equivalence class of configurations and do your evolution in that label space, which avoids having to pick a gauge.