Actually, most major experiments in particle physics these days (including the O...

jules · on Feb 22, 2012

Physics is far ahead of other disciplines in this regard. Choosing your statistical test after you gather the data, selectively removing "outliers" after you gather the data, non-blind interpretation of pictures by humans who have a stake in the outcome and only publishing statistically significant results are all par for the course in e.g. neuroscience.

gammarator · on Feb 23, 2012

This paper [1] points out that commonly-used measures of statistical significance are downright meaningless when additional degrees of freedom are hidden in the way you describe.

[1] http://people.psych.cornell.edu/~jec7/pcd%20pubs/simmonsetal...

jules · on Feb 23, 2012

It's even worse than that paper describes and this is something that every statistics 101 class worth its salt points out: if you are allowed to choose a statistical test after you've gathered your data you can prove any conclusion you want with arbitrarily high confidence. Note that the paper does not list choosing a test before you gather the data as a requirement. The only way to do meaningful statistics is the way splat describes: describe exactly how you're going to analyze it before you gather the data, then send the paper to a journal which decides whether to publish it before the data has been gathered, and then complete the paper by actually doing the experiment and adding the data to the paper.

moonchrome · on Feb 22, 2012

>The scientists write all of their data reduction pipelines before taking any actual data and test their pipelines on simulated data. When they are confident that their pipeline is running as expected they run the experiment, put the data through their pipeline and publish the result, no matter how unexpected it is.

How does that help when the prediction matches the measurement but the experiment is flawed ?

Kaizyn · on Feb 23, 2012

It doesn't. Sometimes two wrongs (the prediction and the measurement) make a right.