Replied here: “Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor”.
Replied here: “Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor”.