As a more general/tangential comment, I’m a bit confused about how “elevate hypothesis to our attention” is supposed to work. I mean it took some conscious effort to come up with a possible mechanistic story about how “inner reward optimizer” might arise, so how were we supposed to come up with such a story without paying attention to “inner reward optimizer” in the first place?
Perhaps it’s not that we should literally pay no attention to “inner reward optimizer” until we have a good mechanistic story for it, but more like we are (or were) paying too much attention to it, given that we don’t (didn’t) yet have a good mechanistic story? (But if so, how to decide how much is too much?)
I think this tangential comment is good; strong-upvote. I was hyperbolic in implying “don’t even raise the reward-optimizer hypothesis to your attention”, and will edit the post accordingly.
As a more general/tangential comment, I’m a bit confused about how “elevate hypothesis to our attention” is supposed to work. I mean it took some conscious effort to come up with a possible mechanistic story about how “inner reward optimizer” might arise, so how were we supposed to come up with such a story without paying attention to “inner reward optimizer” in the first place?
Perhaps it’s not that we should literally pay no attention to “inner reward optimizer” until we have a good mechanistic story for it, but more like we are (or were) paying too much attention to it, given that we don’t (didn’t) yet have a good mechanistic story? (But if so, how to decide how much is too much?)
I think this tangential comment is good; strong-upvote. I was hyperbolic in implying “don’t even raise the reward-optimizer hypothesis to your attention”, and will edit the post accordingly.