tailcalled comments on The shard theory of human values

tailcalled 9 Sep 2022 14:23 UTC
3 points
0
I agree that you need more than just reinforcement learning.

I’m sympathetic to your broader point, but until somebody says exactly what the rewards (a.k.a. “reinforcement events”) are, I’m withholding judgment.

So in a sense this is what I’m getting at. “This resembles prior ideas which seem flawed; how do you intend on avoiding those flaws?”.