I think I don’t know the solution, and if so it’s impossible for me to guess what he thinks if he’s right :)
But maybe he’s thinking of something vague like CIRL, or hierarchical self-supervised learning with generation, etc. But I think he’s thinking of some kind of recurrent network. So maybe he has some clever idea for unsupervised credit assignment?
I think I don’t know the solution, and if so it’s impossible for me to guess what he thinks if he’s right :)
But maybe he’s thinking of something vague like CIRL, or hierarchical self-supervised learning with generation, etc. But I think he’s thinking of some kind of recurrent network. So maybe he has some clever idea for unsupervised credit assignment?