Morendil comments on Beauty quips, “I’d shut up and multiply!”

Morendil 12 May 2010 16:52 UTC
0 points

It seems clear to me that in variation Alpha, ¹⁄₁₁ is the answer and not ¹⁄₂.

What is it that makes that clear to you?

Your variation Alpha strikes me as somewhat under-specified. Here is how I’m tempted to fill in:

We have 10 cryonics patients on hand and a revival procedure. Each patient, upon revival, awakens in a featureless room, alone, and is given either questionnaire Q then set free, or just set free without further ado.

We flip a coin. If it comes up heads, questionnaire Q is given to one patient; tails, it is given to all ten. Questionnaire Q consists of this very narrative plus the question, “What is your credence now that the coin came up heads?”

It seems to me that if the patient has no other relevant information (such as how many patients were revived), their answer ought to be ¹⁄₂, no matter how many revivals occur on tails. This looks a lot more like Stuart Armstrong’s “proof of the SIA” than like SB, though, so I might have to reread that post.

The background information X’=(coin flip, revival with questionnaire) is different from the background information X=(coin flip), but not necessarily enough to alter the answer to the question—unless for some reason each patient is interested in maximizing the number of patients who would get the right answer if they were asked straight out how the coin came up. (Which is how some participants in the discussion have interpreted “credence”, I now believe. Under some assumptions, such as having a payout involved, e.g. getting a candy bar for calling the coin correctly, this is even a legitimate interpretation.)

If you take “credence” to mean “your prior, updated with whatever information you’ve gained that has bearing on how the coin might have come up”, and your prior for the coin is the ⁵⁰⁄₅₀ distribution, then it seems to me that you have nothing to update on, and that the answer is still ¹⁄₂.
- AlephNeil 12 May 2010 17:14 UTC
  0 points
  Parent
  Your filling in is not quite what I had in mind: When I said “one is randomly selected to be revived” I meant to imply “none of the others are revived”.
  
  Also, you may suppose that before entering hibernation, each patient knows that there’s going to be a coin flip and what will happen in each case.
  
  Deducing ¹⁄₁₁ is now just a matter of applying Bayes’ theorem. This may be easier to comprehend if we introduce:
  
  Variation Alpha’:
  
  Same as Variation Alpha except that one of the 10 people is (secretly) designated beforehand to be revived in the event of heads.
  - Morendil 12 May 2010 17:22 UTC
    0 points
    Parent
    
    Your filling in is not quite what I had in mind: When I said “one is randomly selected to be revived” I meant to imply “none of the others are revived”.
    
    How do the variations you suggest make a difference? Do you agree with my conclusions in my own variant?
    - AlephNeil 12 May 2010 17:36 UTC
      0 points
      Parent
      Well, as I’m sure you’ve guessed my aim is to present the “1/2”-er with a ‘smooth spectrum’ of scenarios beginning with something that’s obviously ¹⁄₃ (or in this case ¹⁄₁₁) and ending with something isomorphic to the Sleeping Beauty puzzle, and challenging them to say where along this spectrum the “1/3″-er’s argument breaks down.
      
      In the case of Variation Morendil… hmm, I think the Bayesian reasoning for Variation Alpha goes through just the same, and the answer is ¹⁄₁₁. Doesn’t it? (Does it make a difference if the patients know about the scenario beforehand, rather than being told about it only in the questionnaire? I don’t think so. So pretend they are told beforehand...)
      - Morendil 12 May 2010 17:52 UTC
        0 points
        Parent
        Effectively, either variant comes down to being told: “A fair coin has been flipped, and depending on the result of that flip you are either one of a group of 10 people or a lone subject, what credence do you have in being on the small-group branch?”
        
        It doesn’t seem obvious to me why, in such a situation, I should answer other than ¹⁄₂, so I’m still interested in what makes it obvious to you.
        AlephNeil 12 May 2010 18:00 UTC
        0 points
        Parent
        OK, well let’s start with Variation Alpha’. Consider that there are 20 equally likely possibilities, which we can label (x, y) where x belongs to {heads, tails} and y belongs to {1, …, 10}. Being in possibility (x, y) means “x is the result of the coin toss and y denotes the person we selected beforehand to be revived in the event of heads.”
        
        Suppose that (like Patrick McGoohan) you are number 6. Then out of the 20 possibilities, there are 11 in which you are revived, namely (heads, 6) and (tails, 1) to (tails, 10). Therefore, applying Bayes’ theorem, given that you are revived, the probability of heads is ¹⁄₁₁.
        What links here?
        AlephNeil's comment on Beauty quips, “I’d shut up and multiply!” by neq1 (12 May 2010 18:24 UTC; 0 points)
        Morendil 12 May 2010 22:21 UTC
        2 points
        Parent
        OK. I have a quibble with your formalization but I get a similar result when working it out formally: if my background information consists of the Alpha procedure, then updating on being revived does give me ¹⁄₁₁.
        
        The quibble is that I only know, algebrically, to condition on something that is a variable, so to work out the joint probability distribution at issue I had to introduce the variable z, with values {revived, not revived}. The triplet (H,3,NR) codes for “the coin comes up heads, person 3 gets picked to be revived in the event of heads, and I don’t get revived”. (Clearly this entails that I’m not person 3.)
        
        The joint probability distribution P(x,y,z) factors out, per the product rule, into P(x)P(y)P(z|x,y) since x and y are independent.
        
        Let’s use N=3 for the number of subjects involved, as I want to write out the full joint distribution (in case someone disagrees with that step) and N=10 makes it tedious. Arbitrarily I consider things from the perspective of Two.
        
        (H,1,R)=0
        (H,2,R)=1/6
        (H,3,R)=0
        (H,1,NR)=1/6
        (H,2,NR)=0
        (H,3,NR)=1/6
        (T,1,R)=1/6
        (T,2,R)=1/6
        (T,3,R)=1/6
        (T,1,NR)=0
        (T,2,NR)=0
        (T,3,NR)=0
        
        This seems to check out: the marginal distribution for x is the expected ⁵⁰⁄₅₀, the marginal distribution for y is uniform, it all sums up to 1, it reproduces the setup as described. The conditional distribution P(x,y|z=R) is then:
        
        (H,1)=0
        (H,2)=1/4
        (H,3)=0
        (T,1)=1/4
        (T,2)=1/4
        (T,3)=1/4
        
        Resulting in P(H|z=R)=1/4.
        
        So I agree here that “I have been revived” is proper to update on, and yields 1/(N+1) credence for the coin having come up heads. (It wasn’t obvious to me to start out, and I still don’t rule out having made a mistake somewhere.)
        
        I can see how this works out as equivalent to the variant I described, with z meaning “got the questionnaire” and y meaning “the label of the person picked to receive the questionnaire in the event of heads”. It shouldn’t matter, either, when we learn about the procedure.
        
        Variations Beta and Gamma don’t seem to introduce anything that should matter, because nothing in the original formulation hinges crucially on particular differences in the memories of the N people involved.
        
        I’m not quite sure what Delta means. My interpretation of Delta would be:
        
        We give you a handout describing the procedure, and some time to absorb it, then put you to sleep. We flip a coin; if it comes up head we wake you, if tails we make an atom-level scan of you, and create and wake N-1 copies from the original scan on successive days, inserting the original on the y-th day.
        
        The triplet (H,3,NR) codes for… um… “the coin came up heads, day 3 was picked to awaken the original me in the even of tails, I (someone other than the person to be awakened in the case of heads) was not revived”. Best I can do.
        
        Something seems to have gone awry somewhere: Delta is not formally equivalent to the previous formulations.
        
        Also, any interpretation of Delta has a big difference with Sleeping Beauty: it ends up with N distinct clones of me, whereas SB ends up with a single Beauty.
        What links here?
        Morendil's comment on Beauty quips, “I’d shut up and multiply!” by neq1 (13 May 2010 10:06 UTC; 3 points)
        Morendil's comment on Conditioning on Observers by Joanna Morningstar (13 May 2010 9:01 UTC; 0 points)
        AlephNeil 13 May 2010 3:34 UTC
        0 points
        Parent
        My description of Delta wasn’t great, to be fair. So I’ll clarify (and change it slightly) like this:
        
        If (x, y) where x is in {H, T} and y is in {1,2,3} then:
        
        If H then you are not cloned and wake up on day y. If T then a clone of you is created just before the beginning of day 1. Either you or the clone (doesn’t matter which) is woken for day 1 while the other is kept in storage. Then the one that was kept in storage is cloned just before the beginning of day 2. Etc.
        
        The idea of moving from Gamma to (my new) Delta is “it shouldn’t matter whether the clones are created right away (and possibly never used) or ‘just in time’”.
        
        Anyway, the following idea has occurred to me, for defending ¹⁄₃ as the answer to the original Sleeping Beauty problem: Imagine that there is a clock on the wall and that on any day when SB is woken, the time of day of her awakening is chosen randomly (from a uniform distribution). Then the information that SB gets on awakening is not simply “I was awakened at least once” but “I was awakened at least once at time x”...
        
        ...and I’ll leave you guys to do the calculation, but you get ¹⁄₃, not ¹⁄₂.
        Morendil 13 May 2010 8:05 UTC
        0 points
        Parent
        We still have the same problem: there is no value of z that corresponds to “I am a non-special member of the initial set of N people, and I happen to get unlucky and not be revived”. That makes Delta not equivalent to the other variants. It does very much matter whether “not revived” is subjectively possible!
        
        It feels as if this might be the same point that neq1 made earlier in answer to one of the defenses of ¹⁄₃, so I’d urge you to press on with the formalization and calculation.
        
        My take-away from the discussion (and the two occasions where I changed my mind so far) is that it confirms intuitions aren’t reliable and need to be backed by detailed formalization.
        AlephNeil 13 May 2010 8:57 UTC
        0 points
        Parent
        The calculation is a little bit awkward because seemingly one has to condition on an event of zero probability (which entails division by zero). But we can proceed as follows:
        
        Suppose the number of moments in a day is finite but ‘very large’, call it N.
        
        Let’s list all of the possible outcomes:
        
        If x = heads then SB is woken on Monday, and there are now N equally likely possibilities for when this will be.
        
        If x = tails then SB is woken on Monday and again on Tuesday. There are N^2 equally likely possibilities for the two waking times.
        
        Suppose SB wakes at time t0. Then she can reason thusly: If the coin toss was heads, then the probability of me seeing a clock show t0 was 1/N. Or if the coin toss was tails: Out of the N^2 possibilities, there are N where I see t0 on Monday and N where I see t0 on Tuesday, but I’ve double counted the case where I see t0 on both Monday and Tuesday, so in fact there are 2N-1 equally likely ways this could have happened. Note that (2N-1)/N^2 is roughly equal to 2/N.
        
        So let H be the event “coin is heads” and let T0 be the event “SB sees clock pointing to t0″.
        
        We have: P(T0 | H) = 1/N and P(T0 | ~H) = about 2/N
        
        From Bayes’ theorem: P(H | T0) / P(~H | T0) = (P(H)/P(~H)) (P(T0|H) / P(T0|~H)) = (1/2)/(1/2) (1/N)/(2/N) = 1 * ¹⁄₂ = ¹⁄₂ (roughly)
        
        So the posterior probabilities for H and ~H must be (about) ¹⁄₃ and ²⁄₃ respectively.
        
        The posterior probabilities converge to ¹⁄₃, ²⁄₃ as N goes to infinity.
        
        (Note: The reason for the discrepancy (i.e. the fact that P(H | T0) is not exactly ¹⁄₃) is that SB’s reasoning about ‘double-counting’ the instance when she is woken at t0 both times is actually invalid, and this possibility ought to be double counted. But the entire dispute centers around showing why it has to work this way in the case N = 1, so I think I’m entitled to pretend that the anti-double-counting argument is valid in order to show the contrary.)
        What links here?
        AlephNeil's comment on Conditioning on Observers by Joanna Morningstar (13 May 2010 15:28 UTC; 0 points)
        AlephNeil 13 May 2010 17:49 UTC
        0 points
        Parent
        I can reformulate the argument above much more straightforwardly:
        
        Consider the original Sleeping Beauty problem.
        
        Suppose we fix a pair of symbols {alpha, beta} and say that with probability ¹⁄₂, alpha = “Monday” and beta = “Tuesday”, and with probability ¹⁄₂ alpha = “Tuesday” and beta = “Monday”. (These events are independent of the ‘coin toss’ described in the original problem.)
        
        Sleeping beauty doesn’t know which symbol corresponds to which day. Whenever she is woken, she is shown the symbol corresponding to which day it is. Suppose she sees alpha—then she can reason as follows:
        
        If the coin was heads then my probability of being woken on day alpha was ¹⁄₂. If the coin was tails then my probability of being woken on day alpha was 1. I know that I have been woken on day alpha (and this is my only new information). Therefore, by Bayes’ theorem, the probability that the coin was heads is ¹⁄₃.
        
        (And then the final step in the argument is to say “of course it couldn’t possibly make any difference whether an ‘alpha or beta’ symbol was visible in the room.”)
        
        Now, over the course of these debates I’ve gradually become more convinced that those arguing that the standard, intuitive notion of probability becomes ambiguous in cases like this are correct, so that the problem has no definitive solution. This makes me a little suspicious of the argument above—surely the 1/2-er should be able to write something equally “rigorous”.
        Morendil 13 May 2010 9:07 UTC
        0 points
        Parent
        Sorry, I meant to say I’d urge you to press on with the formalization and calculation in your interpretation of the Delta case.
        
        I’ll punt on the wall-clock idea. I’m not planning to spend any time working out the formalization for anything that involves large numbers of values for any given variable—my skills aren’t up to doing that confidently, and we seem to have enough to go on with formulations of the problem that only involve smaller sets.
        Expand this thread
        AlephNeil 13 May 2010 9:20 UTC
        0 points
        Parent
        OK but intuitively it can’t make any difference whether SB is woken at a fixed or a random time of day, and it can’t make any difference whether there is a clock on the wall.
        
        So the solution to the ‘random-waking, clock on wall variation’ must be the same as the solution of the original SB problem.
        Morendil 13 May 2010 13:06 UTC
        0 points
        Parent
        See this for a crisp, simple formalization which appears to show where the ambiguity between ¹⁄₂ and ¹⁄₃ comes from.
        neq1 12 May 2010 22:28 UTC
        1 point
        Parent
        If you are the person that was selected beforehand to be revived in the event of heads, then I agree with ¹⁄₁₁. Unfortunately, in variation beta we lose the ability to label someone ahead of time. This changes things.
        AlephNeil 13 May 2010 3:11 UTC
        1 point
        Parent
        No it doesn’t. Your clones are subjectively indistinguishable from you, but they’re all in different places at least. Perhaps they’re in rooms labelled 1-10, but not allowed to go outside and look at the number. So the experimenters can toss a D10 and randomly choose a subject without breaking the ‘clone condition’.