Gurkenglas comments on Approval-directed agents

Gurkenglas 24 Nov 2018 14:56 UTC
2 points

We then loop over each action a and take the action with the highest expected answer.

Wasn’t the whole point that we want to avoid such goal-direction?