Nathan Helm-Burger comments on paulfchristiano’s Shortform

Nathan Helm-Burger 17 Apr 2023 15:49 UTC
4 points
As you can see from this prediction market I made, a lot of people currently disagree with me. I expect this will be a different looking distribution a year from now.

https://manifold.markets/NathanHelmBurger/will-gpt5-be-capable-of-recursive-s?r=TmF0aGFuSGVsbUJ1cmdlcg

Here’s an intuition pump analogy for how I’ve been thinking about this. Imagine that I, as someone with a background in neuroscience and ML was granted the following set of abilities. Would you bet that I, with this set of abilities, would be able to do RSI? I would.

Abilities that I would have if I were an ML model trying to self-improve:

Make many copies of myself, and checkpoints throughout the process.

Work at high speed and in parallel with copies of myself.

Read all the existing scientific literature that seemed potentially relevant.

Observe all the connections between my neurons, all the activations of my clones as I expose them to various stimuli or run them through simulations.

Ability to edit these weights and connections.

Ability to add neurons (up to a point) where they seemed most needed, connected in any way I see fit, initialized with whatever weights I choose.

Ability to assemble new datasets and build new simulations to do additional training with.

Ability to freeze some subsection of a clone’s model and thus more rapidly train the remaining unfrozen section.

Ability to take notes and write collaborative documents with my clones working in parallel with me.

Ok. Thinking about that set of abilities, doesn’t it seem like a sufficiently creative, intelligent, determined general agent could successfully self-improve? I think so. I agree it’s unclear where the threshold is exactly, and when a transformer-based ML model will cross that threshold. I’ve made a bet at ‘GPT-5’, but honestly I’m not certain. Could be longer. Could be sooner...