This is a fairly straightforward point, but one I haven’t seen written up before and I’ve personally been wondering a bunch about. I appreciated this post both for laying out the considerations pretty thoroughly, including a bunch or related reading, and laying out some concrete predictions at the end.
I hadn’t seen your posts either (despite searching; I think the lack of widely shared terminology around this problem gets in the way). I’d be very interested to learn more about how your research agenda has progressed since that first post.
This post was mostly intended to be broad audience / narrow message, just (as Raemon says) pointing to the crux here, breaking it down, and giving a sense of the arguments on each side.
I’d be very interested to learn more about how your research agenda has progressed since that first post.
The post about learned lookahead in Leela has kind of galvanised me into finally finishing an investigation I have worked on for too long already. (Partly because I think that finding is incorrect, but also because using Leela is a great idea, I had got stuck with LLMs requiring a full game for each puzzle position).
It so happens I hadn’t seen your other posts, although I think there is something that this post was aiming at, that yours weren’t quite pointed at, which is laying out “this is a crux for timelines, these are the subcomponents of the crux.” (But, I haven’t read your posts in detail yet and thought about what else they might be good at that this post wasn’t aiming for)
Curated.
This is a fairly straightforward point, but one I haven’t seen written up before and I’ve personally been wondering a bunch about. I appreciated this post both for laying out the considerations pretty thoroughly, including a bunch or related reading, and laying out some concrete predictions at the end.
I feel like I have been going on about this for years. Like here, here or here. But I’d be the first to admit, that I don’t really do effort posts.
I hadn’t seen your posts either (despite searching; I think the lack of widely shared terminology around this problem gets in the way). I’d be very interested to learn more about how your research agenda has progressed since that first post. This post was mostly intended to be broad audience / narrow message, just (as Raemon says) pointing to the crux here, breaking it down, and giving a sense of the arguments on each side.
The post about learned lookahead in Leela has kind of galvanised me into finally finishing an investigation I have worked on for too long already. (Partly because I think that finding is incorrect, but also because using Leela is a great idea, I had got stuck with LLMs requiring a full game for each puzzle position).
I will ping you when I write it up.
I’m looking forward to it!
It so happens I hadn’t seen your other posts, although I think there is something that this post was aiming at, that yours weren’t quite pointed at, which is laying out “this is a crux for timelines, these are the subcomponents of the crux.” (But, I haven’t read your posts in detail yet and thought about what else they might be good at that this post wasn’t aiming for)