This is the question for people who didn’t read my latest post. Please, try to answer it yourself without spoiling the solution, and then post it in the comments with your reasoning and whether you consider yourself a halfer or a thirder in regular Sleeping Beauty problem.
Technicolor Sleeping Beauty experiment goes mostly as regular Sleeping Beauty experiment:
The participant is put to sleep on Sunday. Then the coin is tossed. If it’s Heads the participant will be awakened on Monday. If it’s Tails the participant will be awaken both on Monday and on Tuesday, and between these awakenings their memories will be erased. Therefore, while awakening on Tuesday the participants doesn’t remember whether they awakened on Monday or not, and they can never be sure on which day (Monday or Tuesday) their current awakening is happening.
The only difference is that in Technicolor, the walls of the room that the participant awakens in changle their color every day from red to blue and vice versa. The initial color is determined randomly: 1⁄2 that it’s red and 1⁄2 that it’s blue.
While awakened during the experiment you are asked whether you would like to take a specific once per experiment bet:
You may bet that the coin is Tails at 2:3 odds. That is: if you bet 300$ and the coin is indeed Tails you win 200$. The bet will be resolved on Wednesday, after the experiment has ended.
You may take this bet only once per experiment and one agreement is enough. If you have two awakenings in this experiment and agreed on any of them—the bet counts as taken. If you agreed on both of them, the bet counts as taken only once.
For reference, in regular Sleeping Beauty problem utility neutral betting odds for once per experiment bet are 1:1, regardless of whether you are a halfer or a thirder, so taking such bet would be a bad idea.
A probability is an attribute of your model of the world, not an attribute of the world of the world itself. In this case, we’re looking for a world model which cares about the outcome of this particular bet. Specifically, on Sunday, before the experiment starts, the participant wants to choose a policy which produces a good outcome on Wednesday.
On Sunday, the participant determines that the probabilities are
0.5 they will be woken on Monday and offered the bet (value -$300 if they take it)
0.5 they will be woken on Monday and offered the bet, and then woken again on Tuesday and offered the bet. (value +$200 if they take it)
If this were the entire situation, and the participant had to have a single deterministic policy, they should obviously not take the bet (0.5 to accept the bet once and lose $300, 0.5 to accept the bet twice and win $200, so total EV is -$50).
If the participant had access to a source of randomness which would return a random number
r
between 0 and 1 on demand, with the results of that randomness independent between times of getting a random number, they could do better, by saying that they will take the bet ifr <= p
on that roll, for somep
they determined on Sunday. Now there are 6 cases:0.5*(1-p) they will be woken on Monday and see
r > p
(don’t take bet, $0)0.5*p they will be woken on Monday and see
r <= p
(take bet, -$300)0.25*(1-p)*(1-p) they will be woken on Monday and see
r > p
, and then woken again on Tuesday and seer > p
again (don’t take bet either time, $0)0.25*(1-p)*p they will be woken on Monday and see
r > p
, and then woken again on Tuesday and seer <= p
(take bet on Tuesday, +$200)0.25*p*(1-p) they will be woken on Monday and see
r <= p
, and then woken again on Tuesday and seer > p
(take bet on Monday, +$200)0.25*p*p they will be woken on Monday and see
r <= p
, and then woken again on Tuesday and seer <= p
again (take bet on both Monday and Tuesday, only one of which counts, +$200)The degenerate case of
p=1
is the “always take the bet” case above, EV=-$50.p=0
is “never take the bet”, EV=$0. But in between those is a curveIt turns out the maximal EV is obtained when p=0.25, which gives an EV of +$6.25.
If we want to do better than that, we need a way for the participant to ensure that their decision on Tuesday is not only uncorrelated with their decision on Monday, but anticorrelated with their decision on Monday. The only possible source of anticorrelation would be the color of the walls. So the participant can now make a decision on Sunday “with what probability do I take the bet if the walls are red” (
p_r
) and separately “with what probability do I take the bet if the walls are blue” (p_b
). There are now 12 cases to work through (would be 20, but “two wakes, room red Mon, room red Tue” is not possible, nor is “two wakes, room blue Mon, room blue Tue”).Computing the EV for every possible
p_r
andp_b
we see that there are two maxima, one where the participant always bets if in a red room and never bets in a blue room, and the other where the participant always bets if in a blue room and never bets in a red room. Both of these maxima have an EV of +$25.
… what was the philosophical question, again?
I like this question a lot. My base position is “It’s a modeling choice—either one can make appropriate betting choices, so it’s unclear what true probability even means in correlated-existence cases”. Though in private, I’ll admit that 1⁄3 is simpler most of the time, and in public I argue for 1⁄2 fairly often because die-hard thirders seem to be so common.
The difficulty with betting analyses is how to handle the duplicate days, where Beauty’s experience is doubled on tails, but you want that to be irrelevant to the wager (to put focus on the fact that it’s a coin flip, so you have to be indifferent on Wednesday). Saying “you must agree or be put to death” is one way, which leads to a static decision that’s indifferent to heads or tails, which is justified by either halfer or thirder. Saying “bets are off if they disagree” doesn’t work, since only tails can disagree. Saying “Monday takes precedence” kind of obviates the whole excercise.
Your mechanism of “days are ORed together” would normally show the weirdness of the thirder position, because a random choice by Beauty would lead to taking the bet 50% of the time for heads and 75% for tails. But painting the walls gives a coordination mechanism between Monday and Tuesday, without revealing which day it actually is.
I think “bet if the wall is red” works here. If it’s heads (50% of the time, to an outside observer, and to the bet payoff), you’ll take the bet 50% of the time and lose $200. If it’s tails, you’ll take the bet 100% of the time and win $300. 0.5*0.5*-200 + 0.5*300 = +100.
I don’t think it sheds any light on the halfer vs thirder debate—this encourages an outside view, because the payoff is outside and the coordination is outside (which color to pick in any world on any day). But I guess that means it supports my position that it’s a modeling choice rather than a true distinction.