Hacker News new | past | comments | ask | show | jobs | submit login

It always helps me to picture a probability space (as in, an actual physical space, where size correlates to probability).

Assuming that children are always a boy or a girl, and are equally likely to be a boy or a girl, the 'complete space' of families with two children would be distributed as ABBC where A means two boys, B means a boy and a girl, and C means two girls. By excluding C from the space, ABB remains. So indeed, the probability would be 1/3 that both children would be boys.

But now I have made another assumption, namely that the family is picked uniformly random from all families with two children with at least one boy.

As you can see, there are quite a lot of assumptions. One convention seems to be that when you can't tell for sure how something is distributed, it is uniformly distributed.

For example, when a person blindly grabs a ball from a box with one red and one blue ball (and no other balls), the probability is not always 50% that he grabs the red ball. The red ball might be a bowling ball and the blue one might be the size of a marble.

While in this example it is reasonable to assume a uniform distribution, you can point out similar, but more subtle assumptions in most questions about probability.

A more complicated example: Weatherman A predicts the weather right 70% of the time. Weatherman B predicts the weather right 60% of the time. Weatherman A predicts rain tomorrow, weatherman B predicts dry weather. What is the probability that it will rain tomorrow?

The 'right' answer is 14/23. In order to arrive at this answer, you need to assume that the predictions are statistically independent (which is an unrealistic assumption). Indeed, it is easy to sketch a situation in which the probability is different: it is always dry, weatherman A predicts rain 30% of the time, and weatherman B predicts rain 40% of the time. This is consistent with the question, and the probability that there will be rain tomorrow is obviously 0%.

This lack of rigor always bothers me.




Your second example has completely different assumptions than your first. To get 14/23, you don't have to assume that the predictions are independent, only that they are independent given future weather.

Let W be 1 if it will rain tomorrow and 0 if it will not rain, with 0.5 probability either way. Let A be weatherman A's prediction and let B be weatherman B's prediction. We have

  P(W = 0) = P(W = 1) = 0.5
  P(A = w | W = w) = 0.7 # Weatherman A is right 70% of the time
  P(B = w | W = w) = 0.6 # Weatherman B is right 60% of the time

  P(A, B | W) = P(A | W) P(B | W)
Now we want to know "Weatherman A predicts rain tomorrow, weatherman B predicts dry weather. What is the probability that it will rain tomorrow?"

That is, what is P(W=1 | A=1, B=0)?

    P(W=1 | A=1, B=0) = P(A=1, B=0 | W=1) P(W=1) / P(A=1, B=0)
    = P(A=1 | W=1) P(B=0 | W=1) P(W=1) / (sum_w P(A=1, B=0|w)P(w))
    = 0.7*0.4*0.5 / (P(A=1|w=0)P(B=0|w=0)P(w=0) + P(A=1|w=1)P(B=0|w=1)P(w=1))
    = 14/23
If you don't like the assumption that the predictions are statistically independent (herding, etc), then you just have to come up with the conditional joint P(A,B | W). That wouldn't be difficult given a small amount of data since W is binary. You would put a dirichlet prior on the distribution (basically just a beta distribution with an additional dimension) and essentially just count the times each triple (a, b, w) happens.

Still, the problem isn't a lack of rigor, it's a lack of clarity in stated assumptions.

To be specific, you didn't state the assumption that P(w)=0.5, which you used to compute 14/23.


I don't understand your point. I was arguing that the way the problem is posed seems to imply a unique solution. I was showing the problem was ill-posed, and that many probability problems have similar subtle or less subtle hidden assumptions. Here, this assumption is P(A, B | W) = P(A | W) P(B | W). I don't think you even need P(W = 0) = P(W = 1) = 0.5.

These assumptions usually seem quite natural to make, but can be very unrealistic (why would the predictions of weather men be independent? I would bet they are not in reality). This is a very bad to teach students. It is always very important to know which assumptions you are making, and if textbooks do this wrong, it will be nearly impossible for students to get this right.

I would think of a student which struggles with this problem as a better mathematician than the student which just uses P(A, B | W) = P(A | W) P(B | W) 'because the formula is in his textbook', but the second student is more likely to get rewarded (especially in American education, since the USA seems to be especially fond of textbooks which give a ready-made recipe for every problem that a student is supposed to solve).


> I don't think you even need P(W = 0) = P(W = 1) = 0.5.

You do,

In the article, the prior of interest is the prior gender of a child, which can be safely assumed to be 0.5. Similarly, independence of children's genders is a very good approximation as well. It wasn't necessary for the article to state these assumptions because they are obvious common knowledge.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: