The strict version of the simulation objective is optimized by the actual “time evolution” rule that created the training samples. For most datasets, we don’t know what the “true” generative rule is, except in synthetic datasets, where we specify the rule.
I hope I read this before while doing my research proposal. But pretty much have arrived to the same conclusion that I believe alignment research is missing out—the pattern recognition learning systems being researched/deployed currently seems to lack a firm grounding on other fields of sciences like biology or pyschology that at the very least links to chemistry and physics.
I hope I read this before while doing my research proposal. But pretty much have arrived to the same conclusion that I believe alignment research is missing out—the pattern recognition learning systems being researched/deployed currently seems to lack a firm grounding on other fields of sciences like biology or pyschology that at the very least links to chemistry and physics.