I Defy the Data!
One of the great weaknesses of Science is this mistaken idea that if an experiment contradicts the dominant theory, we should throw out the theory instead of the experiment.
Experiments can go awry. They can contain design flaws. They can be deliberately corrupted. They can be unconsciously corrupted. They can be selectively reported. Most of all, 1 time in 20 they can be “statistically significant” by sheer coincidence, and there are a lot of experiments out there.
Unfortunately, Science has this notion that you can never go
against an honestly obtained experimental result. So, when someone
obtains an experimental result that contradicts the standard model,
researchers are faced with a dilemma for resolving their cognitive
dissonance: they either have to immediately throw away the standard model, or else attack the experiment—accuse the researchers of dishonesty, or flawed design, or conflict of interest…
Someone once presented me with a new study on the effects of intercessory prayer (that is, people praying for patients who are not told about the prayer), which showed 50% of the prayed-for patients achieving success at in-vitro fertilization, versus 25% of the control group. I liked this claim. It had a nice large effect size. Claims of blatant impossible effects are much more pleasant to deal with than claims of small impossible effects that are “statistically significant”.
So I cheerfully said: “I defy the data.”
My original phrasing was actually “I deny the data”. Nonetheless I said it outright, without apology, and with deliberate insolence. I am keeping my theory; your experiment is wrong.
If an experimental result contradicts the Standard Model, this is an important fact. It needs to be openly acknowledged. An experiment that makes traditionalists want to discard the data—or even an experiment that makes traditionalists very skeptical of the data—should be a high priority for replication. An experiment worth defying should command attention!
But it is not socially acceptable to say, “The hell with your experimental falsification, I’m keeping my theory.” So the data has to be defied covertly—by character assassination of the researchers, by sly innuendos, by dire hints of controversy. The data has to be dismissed, excused away, swept under a rug, silently into the dark, because you can’t admit you’re defying the data. This is not a good way of focusing attention on an anomalous result. This is not a good way to ensure funding for replication attempts.
It would be much better if science had a standard procedure for saying, “I defy the data!” It would be clearly understood that this was a bold act, and someone else in the audience might stand up and say, “Wait a minute, is that data really worth defying?” If a major figure in the field said “I defy the data!”, this would be sufficient justification on grant proposals for why the result urgently needed replication. Scientists could say, “I’m holding my breath, waiting for replication,” rather than having to take sides immediately in the character-assassination controversy.
Maybe you could even get the media to report that the experiment has been “published but defied”. Then the replication, or failure to replicate, would be news. The replicators could get their names in the newspaper, and the negative result could be published in a major journal. If you want replications done, you’ll have to offer some incentive.
I would also suggest that when an experiment is defied, the replication must pre-declare a minimum effect size, and attain significance of p<0.01. In extreme cases where claims have been made and shot down before, p<0.001.
Oh, and the prayer study? Soon enough we heard that it had been retracted and was probably fraudulent. But I didn’t say fraud. I didn’t speculate on how the results might have been obtained. That would have been dismissive. I just stuck my neck out, and nakedly, boldly, without excuses, defied the data.
Addendum: I should have spelled this out explicitly: You can defy the data on one experiment. You can’t defy the data on multiple experiments. At that point you either have to relinquish the theory or dismiss the data—point to a design flaw, or refer to an even larger body of experiments that failed to replicate the result, or accuse the researchers of a deliberate hoax, et cetera. But you should not turn around and argue that the theory and the experiment are actually compatible. Why didn’t you think of that before you defied the data? Defying the data admits that the data is not compatible with your theory; it sticks your neck way out, so your head can be easily chopped off.
- Update Yourself Incrementally by 14 Aug 2007 14:56 UTC; 115 points) (
- Shut up and do the impossible! by 8 Oct 2008 21:24 UTC; 111 points) (
- I attempted the AI Box Experiment again! (And won—Twice!) by 5 Sep 2013 4:49 UTC; 78 points) (
- [Link] “Fewer than X% of Americans know Y” by 10 Oct 2012 16:59 UTC; 70 points) (
- Covid 8/5: Much Ado About Nothing by 5 Aug 2021 14:00 UTC; 63 points) (
- Conjunction Controversy (Or, How They Nail It Down) by 20 Sep 2007 2:41 UTC; 59 points) (
- Reality is weirdly normal by 25 Aug 2013 19:29 UTC; 55 points) (
- Psychic Powers by 12 Sep 2008 19:28 UTC; 47 points) (
- Logical or Connectionist AI? by 17 Nov 2008 8:03 UTC; 46 points) (
- Picture Frames, Window Frames and Frameworks by 3 Nov 2019 22:09 UTC; 35 points) (
- 23 Dec 2022 17:38 UTC; 28 points) 's comment on Read The Sequences by (EA Forum;
- 27 Aug 2023 5:24 UTC; 24 points) 's comment on Eliezer Yudkowsky Is Frequently, Confidently, Egregiously Wrong by (EA Forum;
- Large Effects Can’t Prove Small Causes by 6 Oct 2020 17:34 UTC; 23 points) (
- Help us Optimize the Contents of the Sequences eBook by 19 Sep 2013 4:31 UTC; 18 points) (
- Update Then Forget by 17 Jan 2013 18:36 UTC; 15 points) (
- [SEQ RERUN] Absence of Evidence Is Evidence of Absence by 17 Jul 2011 4:08 UTC; 12 points) (
- 17 Jul 2022 9:08 UTC; 11 points) 's comment on A summary of every “Highlights from the Sequences” post by (EA Forum;
- 21 May 2022 0:08 UTC; 11 points) 's comment on Is AI Progress Impossible To Predict? by (
- 23 Dec 2011 3:03 UTC; 11 points) 's comment on The rationalist’s checklist by (
- 3 Oct 2011 19:55 UTC; 10 points) 's comment on Rationality Quotes October 2011 by (
- 12 Apr 2011 11:20 UTC; 10 points) 's comment on On Debates with Trolls by (
- 5 Nov 2010 17:22 UTC; 10 points) 's comment on Goertzel on Psi in H+ Magazine by (
- [SEQ RERUN] I Defy the Data! by 15 Jul 2011 23:31 UTC; 9 points) (
- 8 Sep 2012 17:53 UTC; 7 points) 's comment on Open Thread, September 1-15, 2012 by (
- Link: “When Science Goes Psychic” by 8 Jan 2011 9:00 UTC; 7 points) (
- 18 Oct 2011 3:33 UTC; 6 points) 's comment on Particles may not have broken light speed limit by (
- 23 Sep 2011 10:30 UTC; 6 points) 's comment on Gender differences in spatial reasoning appear to be nurture by (
- 19 Feb 2013 5:51 UTC; 5 points) 's comment on Falsifiable and non-Falsifiable Ideas by (
- 10 Oct 2009 8:35 UTC; 5 points) 's comment on I’m Not Saying People Are Stupid by (
- 9 Apr 2013 15:26 UTC; 4 points) 's comment on Problems in Education by (
- 27 Jul 2011 17:19 UTC; 4 points) 's comment on A potentially great improvement to minimum wage laws to handle both economic efficiency as well as poverty concerns by (
- 27 Oct 2010 22:18 UTC; 4 points) 's comment on What hardcore singularity believers should consider doing by (
- 6 Sep 2013 16:16 UTC; 3 points) 's comment on I attempted the AI Box Experiment again! (And won—Twice!) by (
- 13 Oct 2010 20:25 UTC; 3 points) 's comment on Rationality quotes: October 2010 by (
- 31 Oct 2015 11:07 UTC; 2 points) 's comment on At what cost, carnivory? by (EA Forum;
- If rationality is purely winning there is a minimal shared art by 19 Jan 2017 21:39 UTC; 2 points) (
- 13 Oct 2010 16:15 UTC; 2 points) 's comment on Rationality quotes: October 2010 by (
- 15 Jan 2008 9:37 UTC; 1 point) 's comment on Trust in Math by (
- 22 Oct 2009 19:49 UTC; 1 point) 's comment on Rationality Quotes: October 2009 by (
- 25 Aug 2010 17:08 UTC; 1 point) 's comment on Taking Ideas Seriously by (
- 21 May 2011 17:49 UTC; 1 point) 's comment on My Childhood Death Spiral by (
- 23 Aug 2023 13:34 UTC; 0 points) 's comment on trevor1′s Quick takes by (EA Forum;
- 9 Oct 2009 21:35 UTC; 0 points) 's comment on I’m Not Saying People Are Stupid by (
- 15 Mar 2017 14:13 UTC; 0 points) 's comment on Sufficiently sincere confirmation bias is indistinguishable from science by (
- 24 Jul 2009 9:51 UTC; -1 points) 's comment on It’s all in your head-land by (
- 19 Feb 2012 20:58 UTC; -1 points) 's comment on Brain shrinkage in humans over past ~20 000 years—what did we lose? by (
This is well-recognized issue in philosophy of science, known as the (Pierre) Duhem thesis, that any test of a theory is actually a test of a a conjunction of the theory and the theory of the experiment, and that any resulting disconfirming data might have disconfirmed the theory, the theory of the experiment, or both. The picture turns out to be even more complex, because more than just the theory and the theory of the experiment are tested, but so are background, often nonconscious assumptions, and irrelevant assumptions about statistics. The most detailed technical analysis of theory testing was developed by P. Suppes (1962). Models of data, in Logic, methodology, and philosophy of science (E. Nagel et al., eds., Stanford UP. The proud rejection of data in favor of theory is well attested in history of science, and formed a key part of the neo-Popperian philosophy of Imre Lakatos, e.g. (1970) Criticism and the methology of research programs, in Lakatos & Musgrave (Eds.) Criticism and the growth of knowledge, Cambridge, UP.
Leahey, since I’m not familiar with that line of philosophy, I would ask whether they had the concept of defying data as a bold and insolent act, one which calls for further attention, and which explicitly confesses that the theory and the experimental result are not compatible (so that replication will indeed destroy the theory). This is what I had thought was my innovation. I would not characterize the rejection of data in favor of theory as proud, but rather as sticking your head way out so that someone can easily chop it off.
The post reminds me of one of my favorite scenes from a Hitchcock movie (which I transcribed a while back, knowing it would be useful at some point in the future).
In Hitchcock’s 1938 movie The Lady Vanishes, the heroine Iris Henderson is traveling on a train in the same compartment as an old lady. When the old lady disappears (it later turns out to be connected with a spy ring), Iris scours the train in search of her. She meets a German doctor named Dr. Hartz, who accompanies her on her search for the old lady. But when everyone denies having seen the old lady, Dr. Hartz theorizes that some psychological hallucination must have caused Iris to imagine the old lady’s existence.
But then, finally, Iris finds one woman who admits to having seen the old lady. Iris then confronts Dr. Hartz with this new witness:
“You’ll have to think of a fresh theory now, Doctor.”
“It is not necessary,” Dr. Hartz responds. “My theory was a perfectly good one. The facts were misleading.”
I love this idea. It reminds me of a bit of management advice I once heard: knives in the chest, not in the back. For better results, get debate out in the open.
Stuart, the doctor’s latter two statements are interesting, but it would seem his first statement is false. He does need to come up with a fresh theory.
Adoption of the other existing theory (that of Iris) suffices. Devising a new one really is unnecessary.
I’ve read that physics journals use a 0.001% standard for statistical significance. Maybe, in the case of “alternative medicine” or “parapsychology” experiments that must rely on unknown physics to work, we should insist on the same standard? ;)
Eliezer, I think you’re mistaken on the facts—most theories take a lot of experimental anomaly before they get thrown out. For example, Kuhn, in Structure of Scientific Revolutions, for example (which I think is a much better work of history than philosophy, but anyway...), gives a marvelous description of “normal science” as just that—tinkering with the dominant paradigm, fitting new results into it bit by bit, etc.
My experience in engineering fields is quite a bit different. Strange results happened frequently and data was ignored when the circumstances warranted it. Occasionally strange data lead the reseracher to an “A-Ha!” moment but most often the cause was likely due to “operator error” or poor experimental design
The engineers I’ve worked with were quite savvy about the limitations of their experimental methods and these issues were debated vigorously and openly.
Apparently, things are a bit different in pure science. Or maybe I was just oblivious to ‘character assasination’ that was happening.
There is more discussion of this post here as part of the Rerunning the Sequences series.
There’s a particular field in which you have to do this all the time: Parapsychology, in which people with PhD credentials somehow manage to coax absurd results from apparently impeccable experimental procedures. Daryl Bem is a good example, as is Diane Hennesy Powell.
I can even understand why they think we are the ones being unscientific: I just showed you data! What more do you want? Well, to be honest, we want data that doesn’t contradict 200 years of past research in every other scientific field. We know there must be something wrong with your experiment, even if we can pin down what it is (often this comes to light later!); because the result that people can see the future or communicate with the dead just contradicts way too much of what we already knew for us to take it seriously. It’s a lot like publishing a proof that 2=1.
Murray Gell-Mann:
(Also, the people in the superluminal neutrino fiasco at OPERA did all but defy their own data in the last paragraph of their original arXiv paper.)