Previously: Some physics 101 students calculate that a certain pendulum will have a period of approximately 3.6 seconds. Instead, when they run the experiment, the stand holding the pendulum tips over and the whole thing falls on the floor.
The students, being diligent Bayesians, argue that this is strong evidence against Newtonian mechanics, and the professor’s attempts to rationalize the results in hindsight are just that: rationalization in hindsight. What say the professor?
“Hold on now,” the professor answers, “‘Newtonian mechanics’ isn’t just some monolithic magical black box. When predicting a period of approximately 3.6 seconds, you used a wide variety of laws and assumptions and approximations, and then did some math to derive the actual prediction. That prediction was apparently incorrect. But at which specific point in the process did the failure occur?
For instance:
Were there forces on the pendulum weight not included in the free body diagram?
Did the geometry of the pendulum not match the diagrams?
Did the acceleration due to gravity turn out to not be 9.8 m/s^2 toward the ground?
Was the acceleration of the pendulum’s weight times its mass not always equal to the sum of forces acting on it?
Was the string not straight, or its upper endpoint not fixed?
Did our solution of the differential equations governing the system somehow not match the observed trajectory, despite the equations themselves being correct, or were the equations wrong?
Was some deeper assumption wrong, like that the pendulum weight has a well-defined position at each time?
… etc”
The students exchange glances, then smile. “Now those sound like empirically-checkable questions!” they exclaim. The students break into smaller groups, and rush off to check.
Soon, they begin to report back.
“After replicating the setup, we were unable to identify any significant additional forces acting on the pendulum weight while it was hanging or falling. However, once on the floor there was an upward force acting on the pendulum weight from the floor, as well as significant friction with the floor. It was tricky to isolate the relevant forces without relying on acceleration as a proxy, but we came up with a clever - ” … at this point the group is drowned out by another.
“On review of the video, we found that the acceleration of the pendulum’s weight times its mass was indeed always equal to the sum of forces acting on it, to within reasonable error margins, using the forces estimated by the other group. Furthermore, we indeed found that acceleration due to gravity was consistently approximately 9.8 m/s^2 toward the ground, after accounting for the other forces,” says the second group to report.
Another arrives: “Review of the video and computational reconstruction of the 3D arrangement shows that, while the geometry did basically match the diagrams initially, it failed dramatically later on in the experiment. In particular, the string did not remain straight, and its upper endpoint moved dramatically.”
Another: “We have numerically verified the solution to the original differential equations. The error was not in the math; the original equations must have been wrong.”
Another: “On review of the video, qualitative assumptions such as the pendulum being in a well-defined position at each time look basically correct, at least to precision sufficient for this experiment. Though admittedly unknown unknowns are always hard to rule out.” [1]
A few other groups report, and then everyone regathers.
“Ok, we have a lot more data now,” says the professor, “what new things do we notice?”
“Well,” says one student, “at least some parts of Newtonian mechanics held up pretty well. The whole F = ma thing worked, and the force due to gravity was basically as claimed.”
“And notably, the parts which did not hold up as well are parts which don’t generalize as directly to other systems,” says another student.
Another: “Expanding on that: there’s an underspecified step in the use of Newtonian mechanics where we need to figure out what geometry to use, and what the relevant forces are on each body. Our deeper experimental investigation really highlighted that underdetermination, because the underspecified places are where the problems were: the string’s upper endpoint moved, the string didn’t stay straight, there were forces from the floor. All of those were things which we could, in principle, include in the model while still using basically-standard Newtonian mechanics.
On the other hand, that also emphasizes the incompleteness of Newtonian mechanics as a model: it doesn’t fully specify how to figure out the geometry and forces for any particular physical setup. Which does call its predictive power into question somewhat—though at least some parts, like F = ma, made predictions which replicated just fine.”
“But,” another student chimes in, “Newtonian mechanics isn’t completely silent about how to specify geometry and forces for any particular physical setup. We’re supposed to draw free-body diagrams showing which things interact with which other things nearby. And certain common physical components are supposed to exert standard forces—like springs, or friction, or normal force from the ground, or gravity. So while there is some underspecification, there aren’t arbitrary degrees of freedom there. In other words, there should be some imaginable behaviors which aren’t consistent with any Newtonian mechanics-based model…”
“… don’t get started on that, we’ll be here all day,” replies a TA.
“Anyway,” says the professor, “one generalizable takeaway from all this is that Newtonian mechanics isn’t a monolithic black box. Like any practically-useful scientific theory, it has ‘gears’ - individual pieces which we compose in order to apply the theory to specific physical systems. Part of what makes gears useful is that, when a prediction is wrong (as inevitably happens all the time), we can go look at a whole bunch of details from our experiment, and then back out which specific gears were correct and which weren’t.”
“I buy that it worked here, but that’s starting to sound suspiciously like hindsight bias again,” replies a student. “We look at a bunch of details from the experiment, and only then decide which sub-predictions were correct and which weren’t? Sounds fishy.”
The professor: “In practice, predictions are hard to get right, even when the underlying theory is basically correct. Even a problem as simple as rolling a steel ball down a hotwheels ramp and getting it to land in a cup is surprisingly hard to get on the first try. So think of it like this: in order for precise predictions to work well in practice, they basically-always need to come with an implicit disclaimer saying ‘… and if this is wrong, then one or a few of the input assumptions are wrong, but probably not most or all of them, and I’m more confident in <some> and less confident in <others>’. With that implicit fallback built-in, the theory still makes falsifiable predictions even when the headline claim is wrong. Indeed, the theory makes additional falsifiable predictions even when the headline prediction is right—e.g. if we’d found a period of 3.6 seconds for the pendulum, but follow-up investigations found that the string wasn’t taut at all, that sure would be a failed prediction of the theory.
Applied to our pendulum: the implicit prediction would be that the period would most probably be 3.6 seconds, but if not then one or a few of the input assumptions was violated but the theory was otherwise basically correct. And among those input assumptions, F = ma was relatively unlikely to be violated, while failures of geometric assumptions or unaccounted-for forces were more likely. And of course in practice we don’t list all the relevant implicit assumptions, because that rabbit hole runs pretty deep.”
Note that one of the ways in which this story most heavily diverges from scientific practice is that all of the follow-up experiments did get the answers we intuitively expect, and did not diverge from both the original model and the original experiment in still further ways which would themselves require recursive examination to uncover.
The Parable Of The Fallen Pendulum—Part 2
Previously: Some physics 101 students calculate that a certain pendulum will have a period of approximately 3.6 seconds. Instead, when they run the experiment, the stand holding the pendulum tips over and the whole thing falls on the floor.
The students, being diligent Bayesians, argue that this is strong evidence against Newtonian mechanics, and the professor’s attempts to rationalize the results in hindsight are just that: rationalization in hindsight. What say the professor?
“Hold on now,” the professor answers, “‘Newtonian mechanics’ isn’t just some monolithic magical black box. When predicting a period of approximately 3.6 seconds, you used a wide variety of laws and assumptions and approximations, and then did some math to derive the actual prediction. That prediction was apparently incorrect. But at which specific point in the process did the failure occur?
For instance:
Were there forces on the pendulum weight not included in the free body diagram?
Did the geometry of the pendulum not match the diagrams?
Did the acceleration due to gravity turn out to not be 9.8 m/s^2 toward the ground?
Was the acceleration of the pendulum’s weight times its mass not always equal to the sum of forces acting on it?
Was the string not straight, or its upper endpoint not fixed?
Did our solution of the differential equations governing the system somehow not match the observed trajectory, despite the equations themselves being correct, or were the equations wrong?
Was some deeper assumption wrong, like that the pendulum weight has a well-defined position at each time?
… etc”
The students exchange glances, then smile. “Now those sound like empirically-checkable questions!” they exclaim. The students break into smaller groups, and rush off to check.
Soon, they begin to report back.
“After replicating the setup, we were unable to identify any significant additional forces acting on the pendulum weight while it was hanging or falling. However, once on the floor there was an upward force acting on the pendulum weight from the floor, as well as significant friction with the floor. It was tricky to isolate the relevant forces without relying on acceleration as a proxy, but we came up with a clever - ” … at this point the group is drowned out by another.
“On review of the video, we found that the acceleration of the pendulum’s weight times its mass was indeed always equal to the sum of forces acting on it, to within reasonable error margins, using the forces estimated by the other group. Furthermore, we indeed found that acceleration due to gravity was consistently approximately 9.8 m/s^2 toward the ground, after accounting for the other forces,” says the second group to report.
Another arrives: “Review of the video and computational reconstruction of the 3D arrangement shows that, while the geometry did basically match the diagrams initially, it failed dramatically later on in the experiment. In particular, the string did not remain straight, and its upper endpoint moved dramatically.”
Another: “We have numerically verified the solution to the original differential equations. The error was not in the math; the original equations must have been wrong.”
Another: “On review of the video, qualitative assumptions such as the pendulum being in a well-defined position at each time look basically correct, at least to precision sufficient for this experiment. Though admittedly unknown unknowns are always hard to rule out.” [1]
A few other groups report, and then everyone regathers.
“Ok, we have a lot more data now,” says the professor, “what new things do we notice?”
“Well,” says one student, “at least some parts of Newtonian mechanics held up pretty well. The whole F = ma thing worked, and the force due to gravity was basically as claimed.”
“And notably, the parts which did not hold up as well are parts which don’t generalize as directly to other systems,” says another student.
Another: “Expanding on that: there’s an underspecified step in the use of Newtonian mechanics where we need to figure out what geometry to use, and what the relevant forces are on each body. Our deeper experimental investigation really highlighted that underdetermination, because the underspecified places are where the problems were: the string’s upper endpoint moved, the string didn’t stay straight, there were forces from the floor. All of those were things which we could, in principle, include in the model while still using basically-standard Newtonian mechanics.
On the other hand, that also emphasizes the incompleteness of Newtonian mechanics as a model: it doesn’t fully specify how to figure out the geometry and forces for any particular physical setup. Which does call its predictive power into question somewhat—though at least some parts, like F = ma, made predictions which replicated just fine.”
“But,” another student chimes in, “Newtonian mechanics isn’t completely silent about how to specify geometry and forces for any particular physical setup. We’re supposed to draw free-body diagrams showing which things interact with which other things nearby. And certain common physical components are supposed to exert standard forces—like springs, or friction, or normal force from the ground, or gravity. So while there is some underspecification, there aren’t arbitrary degrees of freedom there. In other words, there should be some imaginable behaviors which aren’t consistent with any Newtonian mechanics-based model…”
“Sounds like Bell’s Theorem?” says another student.
“… don’t get started on that, we’ll be here all day,” replies a TA.
“Anyway,” says the professor, “one generalizable takeaway from all this is that Newtonian mechanics isn’t a monolithic black box. Like any practically-useful scientific theory, it has ‘gears’ - individual pieces which we compose in order to apply the theory to specific physical systems. Part of what makes gears useful is that, when a prediction is wrong (as inevitably happens all the time), we can go look at a whole bunch of details from our experiment, and then back out which specific gears were correct and which weren’t.”
“I buy that it worked here, but that’s starting to sound suspiciously like hindsight bias again,” replies a student. “We look at a bunch of details from the experiment, and only then decide which sub-predictions were correct and which weren’t? Sounds fishy.”
The professor: “In practice, predictions are hard to get right, even when the underlying theory is basically correct. Even a problem as simple as rolling a steel ball down a hotwheels ramp and getting it to land in a cup is surprisingly hard to get on the first try. So think of it like this: in order for precise predictions to work well in practice, they basically-always need to come with an implicit disclaimer saying ‘… and if this is wrong, then one or a few of the input assumptions are wrong, but probably not most or all of them, and I’m more confident in <some> and less confident in <others>’. With that implicit fallback built-in, the theory still makes falsifiable predictions even when the headline claim is wrong. Indeed, the theory makes additional falsifiable predictions even when the headline prediction is right—e.g. if we’d found a period of 3.6 seconds for the pendulum, but follow-up investigations found that the string wasn’t taut at all, that sure would be a failed prediction of the theory.
Applied to our pendulum: the implicit prediction would be that the period would most probably be 3.6 seconds, but if not then one or a few of the input assumptions was violated but the theory was otherwise basically correct. And among those input assumptions, F = ma was relatively unlikely to be violated, while failures of geometric assumptions or unaccounted-for forces were more likely. And of course in practice we don’t list all the relevant implicit assumptions, because that rabbit hole runs pretty deep.”
Note that one of the ways in which this story most heavily diverges from scientific practice is that all of the follow-up experiments did get the answers we intuitively expect, and did not diverge from both the original model and the original experiment in still further ways which would themselves require recursive examination to uncover.