I like the way you are almost able to turn this into a ‘positive’ account (the way generalized objectives are a positive account of myopic goals, but speaking in terms of failure to make certain pareto improvements is not). However, I worry that any goal over stated can be converted to a goal over outputs which amounts to the same thing, by calculating the expected value of the action according to the old goal. Presumably you mean some sufficiently simple action-goal so as to exclude this.
Yeah, I agree. I almost said “simple function of the output,” but I don’t actually think simplicity is the right metric here. It’s more like “a function of the output that doesn’t go through the consequences of said output.”
I like the way you are almost able to turn this into a ‘positive’ account (the way generalized objectives are a positive account of myopic goals, but speaking in terms of failure to make certain pareto improvements is not). However, I worry that any goal over stated can be converted to a goal over outputs which amounts to the same thing, by calculating the expected value of the action according to the old goal. Presumably you mean some sufficiently simple action-goal so as to exclude this.
Yeah, I agree. I almost said “simple function of the output,” but I don’t actually think simplicity is the right metric here. It’s more like “a function of the output that doesn’t go through the consequences of said output.”