>One complicating factor is that agents change their opinions about these matters over time.
Yep! This is one of the major issues, and one that I’ll try to model in a soon-to-be-coming post. The whole issue of rigged and influeceable learning processes is connected with trying to learn the preferences of such an agent.
>Or it may be fundamentally indeterminate whether some drives are values or biases.
I think it’s fundamentally indeterminate in principle, but we can make some good judgements in practice.
>One complicating factor is that agents change their opinions about these matters over time.
Yep! This is one of the major issues, and one that I’ll try to model in a soon-to-be-coming post. The whole issue of rigged and influeceable learning processes is connected with trying to learn the preferences of such an agent.
>Or it may be fundamentally indeterminate whether some drives are values or biases.
I think it’s fundamentally indeterminate in principle, but we can make some good judgements in practice.