I’m not convinced this is actually the appropriate way to interpret most two-boxers. I’ve read papers that say things that sound like this claim but I think the distinction that it generally being gestured at is the distinction I’m making here (with different terminology). I even think we get hints of that with the last sentence of your post where you start to talk about agent’s being rewards for their decision theory rather than their decision.
I’m not convinced this is actually the appropriate way to interpret most two-boxers. I’ve read papers that say things that sound like this claim but I think the distinction that it generally being gestured at is the distinction I’m making here (with different terminology). I even think we get hints of that with the last sentence of your post where you start to talk about agent’s being rewards for their decision theory rather than their decision.