No, wait. Thinking about this some more, I realize I’m being goofy.
You offered me a series of bets about “twice as good as the one you’re at right now: 2000 utils” vs “a point that sucked as much as that post-stroke month”. I interpreted that as “I have another stroke” vs. “things suddenly get as much better than they are now as now is better than then” and evaluated those bets based on that interpretation.
But that was a false interpretation, and my results are internally inconsistent. If how-things-were-then is −64.5K, then 2000 is not as much better than they are now as now is better than then… they are merely 1/65th better. In which case I don’t accept that bet, after all… a 1% chance of another stroke vs a 99% chance of a 1/65th improvement in my life is not nearly as compelling.
More generally, I accepted the initial statement that the state we labeled 2000 is “twice as good as” the state we labeled 1000, because that seemed to make sense when we were talking about numbers. But now that I’m trying to actually map those numbers to something, it’s less clear to me that it makes sense.
I mean, it follows that my stroke was “-64 times worse” than how things are now, and… well, what does that even mean?
Sorry… I’m not trying to be a pedant here, I’m just trying to make sure I actually understand what we’re talking about, and it’s pretty clear that I don’t.
Yeah, the notion of “twice as good as things are now” doesn’t actually make sense, because utility is only defined up to affine transformations. (That is, if you decided to raise your utility for every outcome by 1000, you’d make the same decisions afterward as you did before; it’s the relative distances that matter, not the scaling or the place you call 0. It’s rather like the Fahrenheit and Celsius scales for temperature.)
But anyway, you can figure out the relative distances in the same way; call what you have right now 1000, imagine some particular awesome scenario and call that 2000, and then figure out the utility of having another stroke, relative to that. For any plausible scenario (excluding things that could only happen post-Singularity), you should wind up again with an extremely negative (but not ridiculous) number for a stroke.
On the other hand, conscious introspection is a very poor tool for figuring out our relative utilities (to the degree that our decisions can be said to flow from a utility function at all!), because of signaling reasons in particular.
Not that I know of. Just a warning not to be too certain of the results you get from this algorithm- your extrapolations to actual decisions may be far from what you’d actually do.
No, wait. Thinking about this some more, I realize I’m being goofy.
You offered me a series of bets about “twice as good as the one you’re at right now: 2000 utils” vs “a point that sucked as much as that post-stroke month”. I interpreted that as “I have another stroke” vs. “things suddenly get as much better than they are now as now is better than then” and evaluated those bets based on that interpretation.
But that was a false interpretation, and my results are internally inconsistent. If how-things-were-then is −64.5K, then 2000 is not as much better than they are now as now is better than then… they are merely 1/65th better. In which case I don’t accept that bet, after all… a 1% chance of another stroke vs a 99% chance of a 1/65th improvement in my life is not nearly as compelling.
More generally, I accepted the initial statement that the state we labeled 2000 is “twice as good as” the state we labeled 1000, because that seemed to make sense when we were talking about numbers. But now that I’m trying to actually map those numbers to something, it’s less clear to me that it makes sense.
I mean, it follows that my stroke was “-64 times worse” than how things are now, and… well, what does that even mean?
Sorry… I’m not trying to be a pedant here, I’m just trying to make sure I actually understand what we’re talking about, and it’s pretty clear that I don’t.
Yeah, the notion of “twice as good as things are now” doesn’t actually make sense, because utility is only defined up to affine transformations. (That is, if you decided to raise your utility for every outcome by 1000, you’d make the same decisions afterward as you did before; it’s the relative distances that matter, not the scaling or the place you call 0. It’s rather like the Fahrenheit and Celsius scales for temperature.)
But anyway, you can figure out the relative distances in the same way; call what you have right now 1000, imagine some particular awesome scenario and call that 2000, and then figure out the utility of having another stroke, relative to that. For any plausible scenario (excluding things that could only happen post-Singularity), you should wind up again with an extremely negative (but not ridiculous) number for a stroke.
On the other hand, conscious introspection is a very poor tool for figuring out our relative utilities (to the degree that our decisions can be said to flow from a utility function at all!), because of signaling reasons in particular.
Certainly. Or, really, much of anything else. Is there a better tool available in this case?
Not that I know of. Just a warning not to be too certain of the results you get from this algorithm- your extrapolations to actual decisions may be far from what you’d actually do.