I think you point in the same direction as Steven Byrnes’ brain-like-AGI safety: We can learn from how human motivation systems are set up in a way that has exactly the outcomes you mention. We can run simulations and quantify how stable selected motivation systems are under optimization pressure. We will not build the same motivation systems into AGI but maybe a subset that is even more stable.
Yes, that’s exactly the direction this line of thought is pulling me in! Although perhaps I am less certain we can copy the mechanics of the brain, and more keen on looking at the environments that led to human intelligence developing the way it did, and whether we can do the same with AI.
Agree. The project I’m working on primarily tries to model the attention and reward systems. We don’t try to model the brain closely but only structures that are relevant.
I think you point in the same direction as Steven Byrnes’ brain-like-AGI safety: We can learn from how human motivation systems are set up in a way that has exactly the outcomes you mention. We can run simulations and quantify how stable selected motivation systems are under optimization pressure. We will not build the same motivation systems into AGI but maybe a subset that is even more stable.
Yes, that’s exactly the direction this line of thought is pulling me in! Although perhaps I am less certain we can copy the mechanics of the brain, and more keen on looking at the environments that led to human intelligence developing the way it did, and whether we can do the same with AI.
Agree. The project I’m working on primarily tries to model the attention and reward systems. We don’t try to model the brain closely but only structures that are relevant.