EHeller comments on Bayes’ Theorem Illustrated (My Way)

EHeller 22 Dec 2013 16:58 UTC
1 point

No, it’s the exact same question, only the labels are different.

No, it isn’t. You should consider that you are disagreeing with a pretty standard stats question, so odds are high you are wrong. With that in mind, you should reread what people are telling you here.

Now, consider “I flip two coins” the possible outcomes are hh,ht,th,tt

I hope we can agree on that much.

Now, I give you more information and I say “one of the coins is heads,” so we Bayesian update by crossing out any scenario where one coin isn’t heads. There is only 1 (tt)

hh,ht,th

So it should be pretty clear the probability I flipped two heads is ¹⁄₃.

Now, your scenario, flipped two coins (hh,ht,th,tt), and I give you the information “the first coin is heads,” so we cross out everything where the first coin is tails, leaving (hh,ht). Now the probability you flipped two heads is ¹⁄₂.

I don’t know how to make this any more simple.
- bigjeff5 22 Dec 2013 17:08 UTC
  −1 points
  Parent
  http://en.wikipedia.org/wiki/Boy_or_Girl_paradox
  
  I know it’s not the be all end all, but it’s generally reliable on these types of questions, and it gives P = ¹⁄₂, so I’m not the one disagreeing with the standard result here.
  
  Do the math yourself, it’s pretty clear.
  
  Edit: Reading closer, I should say that both answers are right, and the probability can be either ¹⁄₂ or ¹⁄₃ depending on your assumptions. However, the problem as stated falls best to me in the ¹⁄₂ set of assumptions. You are told one child is a boy and given no other information, so the only probability left for the second child is a 50% chance for boy.
  - EHeller 22 Dec 2013 17:16 UTC
    1 point
    Parent
    
    http://en.wikipedia.org/wiki/Boy_or_Girl_paradox
    
    Did you actually read it? It does not agree with you. Look under the heading “second question.”
    
    Do the math yourself, it’s pretty clear.
    
    I did the math in the post above, enumerating the possibilities for you to try to help you find your mistake.
    
    Edit, in response to the edit:
    
    I should say that both answers are right, and the probability can be either ¹⁄₂ or ¹⁄₃ depending on your assumptions.
    
    Which is exactly analogous to what Jiro was saying about the Tuesday question. So we all agree now? Tuesday can raise your probability slightly above 50%, as was said all along.
    
    However, the problem as stated falls best to me in the ¹⁄₂ set of assumptions. You are told one child is a boy and given no other information, so the only probability left for the second child is a 50% chance for boy.
    
    And you are immediately making the exact same mistake again. You are told ONE child is a boy, you are NOT told the FIRST child is a boy. You do understand that these are different?
    - bigjeff5 22 Dec 2013 17:17 UTC
      0 points
      Parent
      Re-read it.
    - bigjeff5 22 Dec 2013 17:23 UTC
      −2 points
      Parent
      The relevant quote from the Wiki:
      
      The paradox arises because the second assumption is somewhat artificial, and when describing the problem in an actual setting things get a bit sticky. Just how do we know that “at least” one is a boy? One description of the problem states that we look into a window, see only one child and it is a boy. This sounds like the same assumption. However, this one is equivalent to “sampling” the distribution (i.e. removing one child from the urn, ascertaining that it is a boy, then replacing). Let’s call the statement “the sample is a boy” proposition “b”. Now we have: P(BB|b) = P(b|BB) P(BB) / P(b) = 1 ¹⁄₄ / ¹⁄₂ = ¹⁄₂. The difference here is the P(b), which is just the probability of drawing a boy from all possible cases (i.e. without the “at least”), which is clearly 0.5. The Bayesian analysis generalizes easily to the case in which we relax the ⁵⁰⁄₅₀ population assumption. If we have no information about the populations then we assume a “flat prior”, i.e. P(GG) = P(BB) = P(G.B) = ¹⁄₃. In this case the “at least” assumption produces the result P(BB|B) = ¹⁄₂, and the sampling assumption produces P(BB|b) = ²⁄₃, a result also derivable from the Rule of Succession.
      
      We have no general population information here. We have one man with at least one boy.
      - EHeller 22 Dec 2013 17:39 UTC
        1 point
        Parent
        I’m not at all sure you understand that quote. Lets stick with the coin flips:
        
        Do you understand why these two questions are different: I tell you- “I flipped two coins, at least one of them came out heads, what is the probability that I flipped two heads?” A:1/3 AND “I flipped two coins, you choose one at random and look at it, its heads.What is the probability I flipped two heads” A: ¹⁄₂
        bigjeff5 22 Dec 2013 18:11 UTC
        0 points
        Parent
        For the record, I’m sure this is frustrating as all getout for you, but this whole argument has really clarified things for me, even though I still think I’m right about which question we are answering.
        
        Many of my arguments in previous posts are wrong (or at least incomplete and a bit naive), and it didn’t click until the last post or two.
        
        Like I said, I still think I’m right, but not because my prior analysis was any good. The ¹⁄₃ case was a major hole in my reasoning. I’m happily waiting to see if you’re going to destroy my latest analysis, but I think it is pretty solid.
        bigjeff5 22 Dec 2013 17:48 UTC
        0 points
        Parent
        Yes, and we are dealing with the second question here.
        
        Is that not what I said before?
        
        We don’t have 1000 families with two children, from which we’ve selected all families that have at least one boy (which gives ¹⁄₃ probability). We have one family with two children. Then we are told one of the children is a boy, and given zero other information. The probability that the second is a boy is ¹⁄₂, so the probability that both are boys is ¹⁄₂.
        
        The possible options for the “Boy born on Tuesday” are not Boy/Girl, Girl/Boy, Boy/Boy. That would be the case in the selection of 1000 families above.
        
        The possible options are Boy (Tu) / Girl, Girl / Boy (Tu), Boy (Tu) / Boy, Boy / Boy (Tu).
        
        There are two Boy/Boy combinations, not one. You don’t have enough information to throw one of them out.
        
        This is NOT a case of sampling.
        EHeller 22 Dec 2013 18:36 UTC
        2 points
        Parent
        As long as you realize there is a difference between those two questions, fine. We can disagree about what assumptions the wording should lead us to, thats irrelevant to the actual statistics and can be an agree-to-disagree situation. Its just important to realize that what the question means/how you get the information is important.
        
        We don’t have 1000 families with two children, from which we’ve selected all families that have at least one boy (which gives ¹⁄₃ probability). We have one family with two children. Then we are told one of the children is a boy, and given zero other information.
        
        If we have one family with two children, of which one is a boy, they are (by definition) a member of the set “all families that have at least one boy.” So it matters how we got the information.
        
        If we got that information by grabbing a kid at random and looking at it (so we have information about one specific child), that is sampling, and it leads to the ¹⁄₂ probability.
        
        If we got that information by having someone check both kids, and tell us “at least one is a boy” we have different information (its information about the set of kids the parents have, not information about one specific kid).
        
        This is NOT a case of sampling.
        
        If it IS sampling (if I grab a kid at random and say “whats your Birthday?” and it happens to be Tuesday), then the probability is ¹⁄₂. (we have information about the specific kid’s birthday).
        
        If instead, I ask the parents to tell me the birthday of one of their children, and the parent says ‘I have at least one boy born on Tuesday’, then we get, instead, information about their set of kids, and the probability is the larger number.
        
        Sampling is what leads to the answer you are supporting.
        bigjeff5 22 Dec 2013 18:54 UTC
        0 points
        Parent
        The answer I’m supporting is based on flat priors, not sampling. I’m saying there are two possible Boy/Boy combinations, not one, and therefore it takes up half the probability space, not ¹⁄₃.
        
        Sampling to the “Boy on Tuesday” problem gives roughly 48% (as per the original article), not 50%.
        
        We are simply told that the man has a boy who was born on tuesday. We aren’t told how he chose that boy, whether he’s older or younger, etc. Therefore we have four possibilites, like I outlined above.
        
        Is my analysis that the possibilities are Boy (Tu) /Girl, Girl / Boy (Tu), Boy (Tu)/Boy, Boy/Boy (Tu) correct?
        
        If so, is not the probability for some combination of Boy/Boy 1/2? If not, why not? I don’t see it.
        
        BTW, contrary to my previous posts, having the information about the boy born on Tuesday is critical because it allows us (and in fact requires us) to distinguish between the two boys.
        
        That was in fact the point of the original article, which I now disagree with significantly less. In fact, I agree with the major premise that the tuesday information pushes the odds of Boy/Boy closer 50%, I just disagree that you can’t reason that it pushes it to exactly 50%.
        EHeller 22 Dec 2013 21:25 UTC
        2 points
        Parent
        
        Is my analysis that the possibilities are Boy (Tu) /Girl, Girl / Boy (Tu), Boy (Tu)/Boy, Boy/Boy (Tu) correct?
        
        No. For any day of the week EXCEPT Tuesday, boy and girl are equivalent. For the case of both children born on Tuesday you have for girls: Boy(tu)/Girl(tu),Girl(tu)/Boy(tu), and for boys: boy(tu)/boy(tu).
        
        That was in fact the point of the original article, which I now disagree with significantly less. In fact, I agree with the major premise that the tuesday information pushes the odds of Boy/Boy closer 50%, I just disagree that you can’t reason that it pushes it to exactly 50%.
        
        This statement leads me to believe you are still confused. Do you agree that if I know a family has two kids, I knock on the door and a boy answers and says “I was born on a Tuesday,” that the probability of the second kid being a girl is 1/2? And in this case, Tuesday is irrelevant? (This the wikipedia called “sampling”)
        
        Do you agree that if, instead, the parents give you the information “one of my two kids is a boy born on a Tuesday”, that this is a different sort of information, information about the set of their children, and not about a specific child?
        bigjeff5 22 Dec 2013 21:54 UTC
        0 points
        Parent
        
        This statement leads me to believe you are still confused. Do you agree that if I know a family has two kids, I knock on the door and a boy answers and says “I was born on a Tuesday,” that the probability of the second kid being a girl is 1/2? And in this case, Tuesday is irrelevant? (This the wikipedia called “sampling”)
        
        I agree with this.
        
        Do you agree that if, instead, the parents give you the information “one of my two kids is a boy born on a Tuesday”, that this is a different sort of information, information about the set of their children, and not about a specific child?
        
        I agree with this if they said something along the lines of “One and only one of them was born on Tuesday”. If not, I don’t see how the Boy(tu)/Boy(tu) configuration has the same probability as the others, because it’s twice as likely as the other two configurations that that is the configuration they are talking about when they say “One was born on Tuesday”.
        
        Here’s my breakdown with 1000 families, to try to make it clear what I mean:
        
        1000 Families with two children, 750 have boys.
        
        Of the 750, 500 have one boy and one girl. Of these 500, ¹⁄₇, or roughly 71 have a boy born on Tuesday.
        
        Of the 750, 250 have two boys. Of these 250, ²⁄₇, or roughly 71 have a boy born on Tuesday.
        
        71 = 71, so it’s equally likely that there are two boys as there are a boy and a girl.
        
        Having two boys doubles the probability that one boy was born on Tuesday compared to having just one boy.
        
        And I don’t think I’m confused about the sampling, because I didn’t use the sampling reasoning to get my result*, but I’m not super confident about that so if I am just keep giving me numbers and hopefully it will click.
        
        *I mean in the previous post, not specifically this post.
        EHeller 22 Dec 2013 22:18 UTC
        2 points
        Parent
        
        Of these 250, ²⁄₇, or roughly 71 have a boy born on Tuesday.
        
        This is wrong. With two boys each with a probability of ¹⁄₇ to be born on Tuesday, the probability of at least one on a Tuesday isn’t ²⁄₇, its 1-(6/7)^2
        Expand this thread
        bigjeff5 22 Dec 2013 22:52 UTC
        −1 points
        Parent
        How can that be? There is a ¹⁄₇ chance that one of the two is born on Tuesday, and there is a ¹⁄₇ chance that the other is born on Tuesday. ¹⁄₇ + ¹⁄₇ is ²⁄₇.
        
        There is also a ¹⁄₄₉ chance that both are born on tuesday, but how does that subtract from the other two numbers? It doesn’t change the probability that either of them are born on Tuesday, and both of those probabilities add.
        trist 22 Dec 2013 22:58 UTC
        4 points
        Parent
        The problem is that you’re counting that 1/49th chance twice. Once for the first brother and once for the second.
        bigjeff5 22 Dec 2013 23:29 UTC
        0 points
        Parent
        I see that now, it took a LOT for me to get it for some reason.
        EHeller 22 Dec 2013 22:58 UTC
        2 points
        Parent
        You overcount, the both on Tuesday is overcounted there. Think of it this way- if I have 8 kids do I have a better than 100% probability of having a kid born on Tuesday?
        
        There is a 1/7x6/7 chance the first is born on Tuesday and the second is born on another day. There is a 1/7x6/7 chance the second is born on Tuesday and the first is born on another day. And there is a ¹⁄₄₉ chance that both are born on Tuesday.
        
        All together thats ¹³⁄₄₉. Alternatively, there is a (6/7)^2 chance that both are born not-on-Tuesday, so 1-(6/7)^2 tells you the complementary probability.
        bigjeff5 22 Dec 2013 23:29 UTC
        1 point
        Parent
        Wow.
        
        I’ve seen that same explanation at least five times and it didn’t click until just now. You can’t distinguish between the two on tuesday, so you can only count it once for the pair.
        
        Which means the article I said was wrong was absolutely right, and if you were told that, say one boy was born on January 17th, the chances of both being born on the same day are 1-(364/365)^2 (ignoring leap years), which gives a final probability of roughly 49.46% that both are boys.
        
        Thanks for your patience!
        
        ETA: I also think I see where I’m going wrong with the terminology—sampling vs not sampling, but I’m not 100% there yet.