I’ll focus on 2 first given that it’s the most important.
2. I would expect sim2real to not be too hard for foundations models because they’re trained over massive distributions which allow and force to generalize to near neighbours. E.g. I think that it wouldn’t be too hard for a LLMbto generalize some knowledge from stories to real life if it had an external memory for instance.
I’m not certain but I feel like robotics is more sensitive to details than plans (which is why I’m mentioning a simulation here).
Finally regarding long horizon I agree that it seems hard but I worry that at current capabilities level you can already build ~any reward model because LLMs, given enough inferences seem generally very capable atb evaluating stuff.
I agree that it’s not something which is very likely. But I disagree that “nobody would do that”. People would do that if it were useful.
I’ve asked some ML engineers and it happens that you don’t look at it for a day. I don’t think that deploying it in the real world changes much. Once again you’re also assuming a pretty advanced formb of security mindset.
I’ll focus on 2 first given that it’s the most important. 2. I would expect sim2real to not be too hard for foundations models because they’re trained over massive distributions which allow and force to generalize to near neighbours. E.g. I think that it wouldn’t be too hard for a LLMbto generalize some knowledge from stories to real life if it had an external memory for instance. I’m not certain but I feel like robotics is more sensitive to details than plans (which is why I’m mentioning a simulation here). Finally regarding long horizon I agree that it seems hard but I worry that at current capabilities level you can already build ~any reward model because LLMs, given enough inferences seem generally very capable atb evaluating stuff.
I agree that it’s not something which is very likely. But I disagree that “nobody would do that”. People would do that if it were useful.
I’ve asked some ML engineers and it happens that you don’t look at it for a day. I don’t think that deploying it in the real world changes much. Once again you’re also assuming a pretty advanced formb of security mindset.