tailcalled comments on johnswentworth’s Shortform

tailcalled Dec 30, 2023, 12:50 PM
4 points
0
More precisely, “do(no press)” means something like “you construct an alternate model of physics where there’s an unstoppable force pushing back against any attempt to push the button”, right? As in, if someone presses the button then it will “mysteriously” seem to be stuck and unpressable. And then subagent 2 believes we live in that world? And “do(press)” presumably means something like “you construct an alternate model of the universe where some mysterious force has suddenly pressed the button”.
Seems like they would immediately want to try to press the button to settle their disagreement? If it can be pressed, then that disprove the “do(no press)” model, which subagent 2 has fully committed. to.
- johnswentworth Dec 30, 2023, 4:51 PM
  3 points
  0
  Parent
  Correct reasoning, but not quite the right notion of do(). “do(no press)” would mean that the button just acts like a completely normal button governed by completely normal physics, right up until the official time at which the button state is to be recorded for the official button-press random variable. And at that exact moment, the button magically jumps into one particular state (either pressed or not-pressed), in a way which is not-at-all downstream of any usual physics (i.e. doesn’t involve any balancing of previously-present forces or anything like that).
  One way to see that the do() operator has to do something-like-this is that, if there’s a variable in a causal model which has been do()-operated to disconnect all parents (but still has some entropy), then the only way to gain evidence about the state of that variable is to look at things causally downstream of it, not things upstream of it.
  - tailcalled 30 Dec 2023 22:04 UTC
    4 points
    0
    Parent
    I think we’re not disagreeing on the meaning of do (just slightly different state of explanation), I just hadn’t realized the extent to which you intended to rely on there being “Two timesteps”.
    (I just meant the forces as a way of describing the jump to a specific position. That is, “mysterious forces” in contrast to a perfectly ordinary explanation for why it went to a position, such as “a guard stabs anybody who tries to press the button”, rather than in contrast to “the button just magically stays place”.)
    I now think the biggest flaw in your idea is that it literally cannot generalize to anything that doesn’t involve two timesteps.