it seems like the details of humans’ desire for their children’s success, or their fear of death, don’t seem to match well with the theory that all human desires come from RL on intrinsic reward. I guess you probably think they do?
That’s the foundational assumption of the shard theory that this sequence is introducing, yes. Here’s the draft of a fuller overview that goes into some detail as to how that’s supposed to work. (Uh, to avoid confusion: I’m not affiliated with the theory. Just spreading information.)
I would disagree that it is an assumption. That same draft talks about the outsized role of self-supervised learning on determining particular ordering and kinds of concepts that humans desires latch onto. Learning from reinforcement is a core component in value formation (under shard theory), but not the only one.
That’s the foundational assumption of the shard theory that this sequence is introducing, yes. Here’s the draft of a fuller overview that goes into some detail as to how that’s supposed to work. (Uh, to avoid confusion: I’m not affiliated with the theory. Just spreading information.)
I would disagree that it is an assumption. That same draft talks about the outsized role of self-supervised learning on determining particular ordering and kinds of concepts that humans desires latch onto. Learning from reinforcement is a core component in value formation (under shard theory), but not the only one.