niplav comments on shortplav

niplav 20 Apr 2024 23:25 UTC
8 points
1
Consider proposing the most naïve formula for logical correlation^[1].

Let a program $p$ be a tuple of code for a Turing machine, intermediate tape states after each command execution, and output. All in binary.

That is $p = (c, t, o)$ , with $c \in {0, 1}^{+}, t \in ({0, 1}^{+})^{+}$ and $o \in {0, 1}^{+}$ .

Let $l = | t |$ be the number of steps that $p$ takes to halt.

Then a formula for the logical correlation $合$ ^[2] of two halting programs $p_{1} = (c_{1}, t_{1}, o_{1}), p_{2} = (c_{2}, t_{2}, o_{2})$ , a tape-state discount factor $γ$ ^[3], and a string-distance metric $d : {0, 1}^{+} \times {0, 1}^{+} \to N$ could be

$合 (p_{1}, p_{2}, γ) = d (o_{1}, o_{2}) - \frac{1}{2 + \sum_{k = 0}^{min (l_{1}, l_{2})} γ^{k} \cdot d (t_{1} (l_{1} - k), t_{2} (l_{2} - k))}$

The lower $合$ , the higher the logical correlation between $p_{1}$ and $p_{2}$ . The minimal value is $- 0.5$ .

If $d (o_{1}, o_{2}) < d (o_{1}, o_{3})$ , then it’s also the case that $合 (p_{1}, p_{2}, γ) < 合 (p_{1}, p_{3}, γ)$ .

One might also want to be able to deal with the fact that programs have different trace lengths, and penalize that, e.g. amending the formula:

$合^{'} (p_{1}, p_{2}, γ) = 合 (p_{1}, p_{2}, γ) + 2^{| l_{1} - l_{2} |}$

I’m a bit unhappy that the code doesn’t factor in the logical correlation, and ideally one would want to be able to compute the logical correlation without having to run the program.

How does this relate to data=code?
1. ↩︎
  Actually not explained in detail anywhere, as far as I can tell. I’m going to leave out all motivation here.
2. ↩︎
  Suggested by GPT-4. Stands for joining, combining, uniting. Also “to suit; to fit”, “to have sexual intercourse”, “to fight, to have a confrontation with”, or “to be equivalent to, to add up”.
3. ↩︎
  Which is needed because tape states close to the output are more important than tape states early on.
- tailcalled 21 Apr 2024 9:28 UTC
  4 points
  0
  Parent
  What do you want to use the logical correlation measure for?
  - niplav 21 Apr 2024 11:14 UTC
    2 points
    0
    Parent
    I don’t have a concrete usage for it yet.
- faul_sname 21 Apr 2024 4:30 UTC
  4 points
  0
  Parent
  
  Ideally one would want to be able to compute the logical correlation without having to run the program.
  
  I think this isn’t possible in the general case. Consider two programs, one of which is “compute the sha256 digest of all 30 byte sequences and halt if the result is 9a56f6b41455314ff1973c72046b0821a56ca879e9d95628d390f8b560a4d803” and the other of which is “compute the md5 digest of all 30 byte sequences and halt if the result is 055a787f2fb4d00c9faf4dd34a233320”.
  
  Any method that was able to compute the logical correlation between those would also be a program which at a minimum reverses all cryptograhic hash functions.
- robo 21 Apr 2024 7:26 UTC
  3 points
  0
  Parent
  How close would this rank a program p with a universal Turing machine simulating p? My sense is not very close because the “same” computation steps on each program don’t align.
  My “most naïve formula for logical correlation” would be something like put a probability distribution on binary string inputs, treat $p_{1}$ and $p_{2}$ as random variables ${0, 1}^{*} \to {0, 1}^{*} \cup {⊥}$ , and compute their mutual information.
  - niplav 21 Apr 2024 9:03 UTC
    2 points
    0
    Parent
    The strongest logical correlation is −0.5, the lower the better.
    
    For $p$ and $sim (p)$ , the logical correlation would be $- ε$ , assuming that $p$ and $sim (p)$ have the same output. This is a pretty strong logical correlation.
    
    This is because equal output guarantees a logical correlation of at most 0, and one can then improve the logical correlation by also having similar traces. If the outputs have string distance 1, then the smallest logical correlation can be only 0.5.
- Alexander Gietelink Oldenziel 20 Apr 2024 23:48 UTC
  2 points
  0
  Parent
  Could you say more about the motivation here ?
  - niplav 20 Apr 2024 23:54 UTC
    3 points
    0
    Parent
    Whenever people have written/talked about ECL, a common thing I’ve read/heard was that “of course, this depends on us finding some way of saying that one decision algorithm is similar/dissimilar to another one, since we’re not going to encounter the case of perfect copies very often”. This was at least the case when I last asked Oesterheld about this, but I haven’t read Treutlein 2023 closely enough yet to figure out whether he has a satisfying solution.
    
    The fact we didn’t have a characterization of logical correlation bugged me and was in the back of my mind, since it felt like a problem that one could make progress on. Today in the shower I was thinking about this, and the post above is what came of it.
    
    (I also have the suspicion that having a notion of “these two programs produce the same/similar outputs in a similar way” might be handy in general.)
    - Mateusz Bagiński 21 Apr 2024 13:47 UTC
      3 points
      0
      Parent
      If you want to use it for ECL, then it’s not clear to me why internal computational states would matter.
      - niplav 21 Apr 2024 20:46 UTC
        2 points
        0
        Parent
        My reason for caring about internal computational states is: In the twin prisoners dilemma^[1], I cooperate because we’re the same algorithm. If we modify the twin to have a slightly longer right index-finger-nail, I would still cooperate, even though they’re a different algorithm, but little enough has been changed about the algorithm that the internal states that they’re still similar enough.
        
        But it could be that I’m in a prisoner’s dilemma with some program $p^{⋆}$ that, given some inputs, returns the same outputs as I do, but for completely different “reasons”—that is, the internal states are very different, and a slight change in input would cause the output to be radically different. My logical correlation with $p^{⋆}$ is pretty small, because, even though it gives the same output, it gives that output for very different reasons, so I don’t have much control over its outputs by controlling my own computations.
        
        At least, that’s how I understand it.
        
        ↩︎
        Is this actually ECL, or just acausal trade?