Using a different sort of statistical test than Bem used, they re-analyze Bem’s data and they find that, while the results are positive, they are not positive enough to pass the level of “statistical significance.” They conclude that a somewhat larger sample size would be needed to conclude statistical significance using the test they used.
Err, that’s not what they found. Over half the data was not merely “not positive enough”, but literally negative.
From the blog post:
Err, that’s not what they found. Over half the data was not merely “not positive enough”, but literally negative.