Raemon comments on LLM Generality is a Timeline Crux

Raemon 24 Jun 2024 20:51 UTC
12 points
7
Curated.
This is a fairly straightforward point, but one I haven’t seen written up before and I’ve personally been wondering a bunch about. I appreciated this post both for laying out the considerations pretty thoroughly, including a bunch or related reading, and laying out some concrete predictions at the end.
- p.b. 25 Jun 2024 12:35 UTC
  4 points
  0
  Parent
  This is a fairly straightforward point, but one I haven’t seen written up before and I’ve personally been wondering a bunch about.
  I feel like I have been going on about this for years. Like here, here or here. But I’d be the first to admit, that I don’t really do effort posts.
  - eggsyntax 26 Jun 2024 8:38 UTC
    4 points
    0
    Parent
    I hadn’t seen your posts either (despite searching; I think the lack of widely shared terminology around this problem gets in the way). I’d be very interested to learn more about how your research agenda has progressed since that first post. This post was mostly intended to be broad audience / narrow message, just (as Raemon says) pointing to the crux here, breaking it down, and giving a sense of the arguments on each side.
    - p.b. 26 Jun 2024 16:49 UTC
      2 points
      0
      Parent
      I’d be very interested to learn more about how your research agenda has progressed since that first post.
      The post about learned lookahead in Leela has kind of galvanised me into finally finishing an investigation I have worked on for too long already. (Partly because I think that finding is incorrect, but also because using Leela is a great idea, I had got stuck with LLMs requiring a full game for each puzzle position).
      I will ping you when I write it up.
      - eggsyntax 26 Jun 2024 18:43 UTC
        1 point
        0
        Parent
        I’m looking forward to it!
  - Raemon 25 Jun 2024 18:06 UTC
    3 points
    0
    Parent
    It so happens I hadn’t seen your other posts, although I think there is something that this post was aiming at, that yours weren’t quite pointed at, which is laying out “this is a crux for timelines, these are the subcomponents of the crux.” (But, I haven’t read your posts in detail yet and thought about what else they might be good at that this post wasn’t aiming for)