I didn’t methodically order the experiment ideas, but they are meant to be roughly presented in order of some combination of concreteness/tractability and importance.
What do you think of my speculation about the tagging/switching/routing internal mechanism?
Regarding to how they do it, I believe the relevant passage to be:
Because distinct tasks within a domain can share identical embodiments, observation formats and action specifications, the model sometimes needs further context to disambiguate tasks. Rather than providing e.g. one-hot task identifiers, we instead take inspiration from (Brown et al., 2020; Sanh et al., 2022; Wei et al., 2021) and use prompt conditioning.
I guess it should be possible to locate the activation paths for different tasks, as the tasks are pretty well separated. Something on the lines of https://github.com/jalammar/ecco
I didn’t methodically order the experiment ideas, but they are meant to be roughly presented in order of some combination of concreteness/tractability and importance.
What do you think of my speculation about the tagging/switching/routing internal mechanism?
When you say “switching” it reminds me of the “big switch” approach of https://en.wikipedia.org/wiki/General_Problem_Solver.
Regarding to how they do it, I believe the relevant passage to be:
I guess it should be possible to locate the activation paths for different tasks, as the tasks are pretty well separated. Something on the lines of https://github.com/jalammar/ecco