Well, transparency is definitely a challenge. I’m mostly saying this is a technical challenge even if you have magical transparency tools, and I’m kind of trying to design the system you would want to use if you had magical transparency tools.
But I don’t think it’s difficult for the reason you say. I don’t think multi-level feedback or whole-process feedback should be construed as requiring the levels to be sorted out nicely. Whole-process feedback in particular just means that you can give feedback on the whole chain of computation; it’s basically against sorting into levels.
Multi-level feedback means, to me, that if we have an insight about, EG, how to think about value uncertainty (which is something like a 3rd-level thought: 1st level is information about object level; 2nd level is information about the value function; 3rd level is information about how to learn the value function), we can give the system feedback about that. So the system doesn’t need to sort things out into levels; it just needs to be capable of accepting feedback of each type.
Well, transparency is definitely a challenge. I’m mostly saying this is a technical challenge even if you have magical transparency tools, and I’m kind of trying to design the system you would want to use if you had magical transparency tools.
But I don’t think it’s difficult for the reason you say. I don’t think multi-level feedback or whole-process feedback should be construed as requiring the levels to be sorted out nicely. Whole-process feedback in particular just means that you can give feedback on the whole chain of computation; it’s basically against sorting into levels.
Multi-level feedback means, to me, that if we have an insight about, EG, how to think about value uncertainty (which is something like a 3rd-level thought: 1st level is information about object level; 2nd level is information about the value function; 3rd level is information about how to learn the value function), we can give the system feedback about that. So the system doesn’t need to sort things out into levels; it just needs to be capable of accepting feedback of each type.