benelliott comments on [SEQ RERUN] Beautiful Probability

benelliott 25 Dec 2011 23:06 UTC
0 points
He does not discard anything that actually happened.

This is the key difference. We are evaluating the effectiveness of the drug by looking at what the drug actually did, not what it could have done.

I can give a much more precise mathematical proof if you want.
- FeepingCreature 26 Dec 2011 1:08 UTC
  0 points
  Parent
  Let’s imagine a scientist did 500 tests. Then he started discarding tests, from the end, until the remaining data supported some hypothesis (or he ran out of tests). Is this to be treated as evidence of the same strength as it would if he had precommitted to only doing that many tests?
  - benelliott 26 Dec 2011 1:18 UTC
    0 points
    Parent
    I may be wrong here because I’m tired, but I think the way the maths comes out is that this would be as strong if he only removed tests from the end, whereas if he removed them from anywhere he chose depending on how they came out it would not be as strong.