EHeller comments on Bayes’ Theorem Illustrated (My Way)

EHeller 22 Dec 2013 18:36 UTC
2 points
0
As long as you realize there is a difference between those two questions, fine. We can disagree about what assumptions the wording should lead us to, thats irrelevant to the actual statistics and can be an agree-to-disagree situation. Its just important to realize that what the question means/how you get the information is important.

We don’t have 1000 families with two children, from which we’ve selected all families that have at least one boy (which gives ¹⁄₃ probability). We have one family with two children. Then we are told one of the children is a boy, and given zero other information.

If we have one family with two children, of which one is a boy, they are (by definition) a member of the set “all families that have at least one boy.” So it matters how we got the information.

If we got that information by grabbing a kid at random and looking at it (so we have information about one specific child), that is sampling, and it leads to the ¹⁄₂ probability.

If we got that information by having someone check both kids, and tell us “at least one is a boy” we have different information (its information about the set of kids the parents have, not information about one specific kid).

This is NOT a case of sampling.

If it IS sampling (if I grab a kid at random and say “whats your Birthday?” and it happens to be Tuesday), then the probability is ¹⁄₂. (we have information about the specific kid’s birthday).

If instead, I ask the parents to tell me the birthday of one of their children, and the parent says ‘I have at least one boy born on Tuesday’, then we get, instead, information about their set of kids, and the probability is the larger number.

Sampling is what leads to the answer you are supporting.
- bigjeff5 22 Dec 2013 18:54 UTC
  0 points
  0
  Parent
  The answer I’m supporting is based on flat priors, not sampling. I’m saying there are two possible Boy/Boy combinations, not one, and therefore it takes up half the probability space, not ¹⁄₃.
  
  Sampling to the “Boy on Tuesday” problem gives roughly 48% (as per the original article), not 50%.
  
  We are simply told that the man has a boy who was born on tuesday. We aren’t told how he chose that boy, whether he’s older or younger, etc. Therefore we have four possibilites, like I outlined above.
  
  Is my analysis that the possibilities are Boy (Tu) /Girl, Girl / Boy (Tu), Boy (Tu)/Boy, Boy/Boy (Tu) correct?
  
  If so, is not the probability for some combination of Boy/Boy 1/2? If not, why not? I don’t see it.
  
  BTW, contrary to my previous posts, having the information about the boy born on Tuesday is critical because it allows us (and in fact requires us) to distinguish between the two boys.
  
  That was in fact the point of the original article, which I now disagree with significantly less. In fact, I agree with the major premise that the tuesday information pushes the odds of Boy/Boy closer 50%, I just disagree that you can’t reason that it pushes it to exactly 50%.
  - EHeller 22 Dec 2013 21:25 UTC
    2 points
    0
    Parent
    
    Is my analysis that the possibilities are Boy (Tu) /Girl, Girl / Boy (Tu), Boy (Tu)/Boy, Boy/Boy (Tu) correct?
    
    No. For any day of the week EXCEPT Tuesday, boy and girl are equivalent. For the case of both children born on Tuesday you have for girls: Boy(tu)/Girl(tu),Girl(tu)/Boy(tu), and for boys: boy(tu)/boy(tu).
    
    That was in fact the point of the original article, which I now disagree with significantly less. In fact, I agree with the major premise that the tuesday information pushes the odds of Boy/Boy closer 50%, I just disagree that you can’t reason that it pushes it to exactly 50%.
    
    This statement leads me to believe you are still confused. Do you agree that if I know a family has two kids, I knock on the door and a boy answers and says “I was born on a Tuesday,” that the probability of the second kid being a girl is 1/2? And in this case, Tuesday is irrelevant? (This the wikipedia called “sampling”)
    
    Do you agree that if, instead, the parents give you the information “one of my two kids is a boy born on a Tuesday”, that this is a different sort of information, information about the set of their children, and not about a specific child?
    - bigjeff5 22 Dec 2013 21:54 UTC
      0 points
      0
      Parent
      
      This statement leads me to believe you are still confused. Do you agree that if I know a family has two kids, I knock on the door and a boy answers and says “I was born on a Tuesday,” that the probability of the second kid being a girl is 1/2? And in this case, Tuesday is irrelevant? (This the wikipedia called “sampling”)
      
      I agree with this.
      
      Do you agree that if, instead, the parents give you the information “one of my two kids is a boy born on a Tuesday”, that this is a different sort of information, information about the set of their children, and not about a specific child?
      
      I agree with this if they said something along the lines of “One and only one of them was born on Tuesday”. If not, I don’t see how the Boy(tu)/Boy(tu) configuration has the same probability as the others, because it’s twice as likely as the other two configurations that that is the configuration they are talking about when they say “One was born on Tuesday”.
      
      Here’s my breakdown with 1000 families, to try to make it clear what I mean:
      
      1000 Families with two children, 750 have boys.
      
      Of the 750, 500 have one boy and one girl. Of these 500, ¹⁄₇, or roughly 71 have a boy born on Tuesday.
      
      Of the 750, 250 have two boys. Of these 250, ²⁄₇, or roughly 71 have a boy born on Tuesday.
      
      71 = 71, so it’s equally likely that there are two boys as there are a boy and a girl.
      
      Having two boys doubles the probability that one boy was born on Tuesday compared to having just one boy.
      
      And I don’t think I’m confused about the sampling, because I didn’t use the sampling reasoning to get my result*, but I’m not super confident about that so if I am just keep giving me numbers and hopefully it will click.
      
      *I mean in the previous post, not specifically this post.
      - EHeller 22 Dec 2013 22:18 UTC
        2 points
        0
        Parent
        
        Of these 250, ²⁄₇, or roughly 71 have a boy born on Tuesday.
        
        This is wrong. With two boys each with a probability of ¹⁄₇ to be born on Tuesday, the probability of at least one on a Tuesday isn’t ²⁄₇, its 1-(6/7)^2
        bigjeff5 22 Dec 2013 22:52 UTC
        −1 points
        0
        Parent
        How can that be? There is a ¹⁄₇ chance that one of the two is born on Tuesday, and there is a ¹⁄₇ chance that the other is born on Tuesday. ¹⁄₇ + ¹⁄₇ is ²⁄₇.
        
        There is also a ¹⁄₄₉ chance that both are born on tuesday, but how does that subtract from the other two numbers? It doesn’t change the probability that either of them are born on Tuesday, and both of those probabilities add.
        trist 22 Dec 2013 22:58 UTC
        4 points
        0
        Parent
        The problem is that you’re counting that 1/49th chance twice. Once for the first brother and once for the second.
        bigjeff5 22 Dec 2013 23:29 UTC
        0 points
        0
        Parent
        I see that now, it took a LOT for me to get it for some reason.
        EHeller 22 Dec 2013 22:58 UTC
        2 points
        0
        Parent
        You overcount, the both on Tuesday is overcounted there. Think of it this way- if I have 8 kids do I have a better than 100% probability of having a kid born on Tuesday?
        
        There is a 1/7x6/7 chance the first is born on Tuesday and the second is born on another day. There is a 1/7x6/7 chance the second is born on Tuesday and the first is born on another day. And there is a ¹⁄₄₉ chance that both are born on Tuesday.
        
        All together thats ¹³⁄₄₉. Alternatively, there is a (6/7)^2 chance that both are born not-on-Tuesday, so 1-(6/7)^2 tells you the complementary probability.
        bigjeff5 22 Dec 2013 23:29 UTC
        1 point
        0
        Parent
        Wow.
        
        I’ve seen that same explanation at least five times and it didn’t click until just now. You can’t distinguish between the two on tuesday, so you can only count it once for the pair.
        
        Which means the article I said was wrong was absolutely right, and if you were told that, say one boy was born on January 17th, the chances of both being born on the same day are 1-(364/365)^2 (ignoring leap years), which gives a final probability of roughly 49.46% that both are boys.
        
        Thanks for your patience!
        
        ETA: I also think I see where I’m going wrong with the terminology—sampling vs not sampling, but I’m not 100% there yet.