>> But if you tweak the hypothesis just a little, the data suddenly confirm it T...

probably_wrong · on Oct 25, 2018

Refining an experiment is not wrong. What is wrong, like you say, is going on a fishing expedition until you find a result you like.

There are methods to account for follow-up experiments. Bonfaroni correction [1], for instance, requires you to increase your significance level with each new test.

[1] https://en.m.wikipedia.org/wiki/Bonferroni_correction

soVeryTired · on Oct 25, 2018

It's harder than you might think to control for multiple comparisons. The Bonfaroni correction assumes that each experiment is independent, and so penalises correlated experiments unnecessarily harshly.

On the other hand, other tests typically require the researcher to make explicit assumptions on the correlation structure of the experiments despite the fact that it is not directly observable.

srean · on Oct 25, 2018

You are probably thinking of Sidak correction when you state independence is needed. Bonferroni correction does not need independence. You are absolutely right about Bonferroni being a severely conservative correction though -- at least the 'first order' one that uses only the first term of the Bonferroni inequality. One can take more terms to be less conservative but those aren't as easy to apply as you need to know the joint distributions over larger and larger tuples of events.

Another more recent technique for 'exploratory' yet correct technique is to exploit differential privacy and dithering.

rumcajz · on Oct 25, 2018

You can also split the dataset into two parts. Use first part to form a hypothesis. Register it. Then use the second part to confirm/disprove it.

srean · on Oct 25, 2018

>This is 'data mining' right?

That would be datamining done wrong. Its perfectly fine to look at data to provoke new hypothesis. But you should not be using the same data to confirm the hypothesis that it provoked. Either use fresh data or make sure that you still ensure correctness if you are reusing the data.