I appreciate your reply. But I'm still not clear on it. Because of this: "At lea...

ced · on Sept 14, 2010

Three scenarios remain: [B,B], [B,G], [G,B], so the answer is 1/3.

Maybe you were thinking that the order doesn't matter. In that case, what was the probability of getting a boy and a girl, in any order? 1/4 + 1/4 = 1/2. So that's still twice as likely as getting two boys, and that ratio (2:1) will still hold after eliminating [G,G]. You again get 1/3.

Some people find it easier to picture it in terms of frequencies. Imagine 1000 families. What fraction of them have two boys, among those that have at least one boy?

iliketosleep · on Sept 14, 2010

ced, yes i was thinking that the order doesn't matter. I think this is what it comes down to. Do you think that order matters? If so, why?

yes i do find it easier to picture in terms of frequencies. in this case, take 1000 families which fulfill the criteria of "2 children with at least 1 boy". what is the probability that a family will have 2 boys? we have not sampled randomly. we have sampled according to the "2 children with at least 1 boy" criteria. we are not dealing with two random variables. one variable is fixed and we sampled according to it. now we are working with one independent random variable within that sample. that random variable has P = 1/2.

is there a flaw in my logic? if there is, please highlight it. i think the main confusion is: 1. we have sampled according to particular criteria. 2. we need to calculate a probability within that sample. NOT the population that sample was taken from.

tel · on Sept 14, 2010

In a sampling of 1000 families, the expected values of each kind of family is as follows:

  2xB : 250
  1xB, 1xG: 500
  2xG : 250

Sampling this population ignoring any family that has no boys leads to the probabilities

  2xB : 1/3
  1xB, 1xG: 2/3rds

You're still looking at the same probabilities; the models agree.

I don't fully understand your two random variables formulation. I think the confusion you're getting at is that there is an assumption that the chance of any given birth being male is theta = 0.5. The question however is not

"I have two children, at least one is a boy, what is the probability that my next child is a boy?"

It instead has to do with binomial probabilities on the space of a few repeated trials under parameter theta. The distribution is no longer flat.

Here's a more stark example of a similar form.

"I have 300 children, and at least 1 is a boy. What are the odds that I have no girls?"

iliketosleep · on Sept 14, 2010

ok, you stated this very well. It's now clear where the confusions arises: "Sampling this population ignoring any family that has no boys." Yes, with this interpretation the answer is 1/3, but it's contrary to my interpretation.

Actually, for anyone who is interested, see "Boy or Girl paradox". There is literature on this which discusses the different interpretations.

tel · on Sept 15, 2010

That sampling arises because being part of the population which has no boys is * necessary and sufficient* to (truthfully) make the statement that forms the paradox.

The alternative interpretation of the paradox arises when the wording of the paradox is construed to identify one of the children as male or female. In this case (stating something like "my first child is male"), being part of the population (x \in {BB, BG}) is necessary and sufficient and leads to the 1/2 probability of having two boys.

In short, the question becomes whether you believe the child is identified in the wording of the question. Honestly, the author of the paradox goes pretty far out of their way to say "at least one of the children is male" avoiding that identification.