Update: I wrote a big article “Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor” in which I develop the thinking behind the comment above (but also update it significantly).
Update: I wrote a big article “Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor” in which I develop the thinking behind the comment above (but also update it significantly).