As I understand it, the distinction is that “Goalcraft” is the problem of deciding what we want, while Outer Alignment is the problem of encoding that goal into the reward function of a Reinforcement Learning process.So they’re at different abstraction levels, or steps in the process.
As I understand it, the distinction is that “Goalcraft” is the problem of deciding what we want, while Outer Alignment is the problem of encoding that goal into the reward function of a Reinforcement Learning process.So they’re at different abstraction levels, or steps in the process.