Noosphere89 comments on johnswentworth’s Shortform

Noosphere89 24 Dec 2024 16:51 UTC
3 points
0

of the amazing things they do should be considered surprising facts about how far this trick can scale; not surprising facts about how close we are to AGI.

I agree that the trick scaling as far as it has is surprising, but I’d disagree with the claim that this doesn’t bear on AGI.

I do think that something like dumb scaling can mostly just work, and I think the main takeaway I take from AI progress is that there will not a be a clear resolution to when AGI happens, as the first AIs to automate AI research will have very different skill profiles from humans, and most importantly we need to disentangle capabilities in a way we usually don’t for humans.

I agree with faul sname here:

we should stop asking when we will get AGI and start asking about when we will see each of the phenomena that we are using AGI as a proxy for”.
- Thane Ruthenis 24 Dec 2024 17:09 UTC
  5 points
  0
  Parent
  I do think that something like dumb scaling can mostly just work
  The exact degree of “mostly” is load-bearing here. You’d mentioned provisions for error-correction before. But are the necessary provisions something simple, such that the most blatantly obvious wrappers/prompt-engineering works, or do we need to derive some additional nontrivial theoretical insights to correctly implement them?
  Last I checked, AutoGPT-like stuff has mostly failed, so I’m inclined to think it’s closer to the latter.
  - Noosphere89 24 Dec 2024 17:16 UTC
    20 points
    0
    Parent
    Actually, I’ve changed my mind, in that the reliability issue probably does need at least non-trivial theoretical insights to make AIs work.
    - faul_sname 24 Dec 2024 23:58 UTC
      14 points
      2
      Parent
      I am unconvinced that “the” reliability issue is a single issue that will be solved by a single insight, rather than AIs lacking procedural knowledge of how to handle a bunch of finicky special cases that will be solved by online learning or very long context windows once hardware costs decrease enough to make one of those approaches financially viable.
      - Noosphere89 25 Dec 2024 0:52 UTC
        4 points
        0
        Parent
        Yeah, I’m sympathetic to this argument that there won’t be a single insight, and that at least one approach will work out once hardware costs decrease enough, and I agree less with Thane Ruthenis’s intuitions here than I did before.
    - Noosphere89 24 Dec 2024 17:59 UTC
      10 points
      2
      Parent
      If I were to think about it a little, I’d suspect the big difference that LLMs and humans have is state/memory, where humans do have state/memory, but LLMs are currently more or less stateless today, and RNN training has not been solved to the extent transformers were.
      
      One thing I will also say is that AI winters will be shorter than previous AI winters, because AI products can now be sort of made profitable, and this gives an independent base of money for AI research in ways that weren’t possible pre-2016.
      - Mateusz Bagiński 24 Dec 2024 22:08 UTC
        5 points
        3
        Parent
        A factor stemming from the same cause but pushing in the opposite direction is that “mundane” AI profitability can “distract” people who would otherwise be AGI hawks.