I was including the current level of RLHF as already not qualifying as “pure autoregressive LLMs”. IMO the RLHF is doing a bunch of important work at least at current capability levels (and my guess is also will do some important work at the first dangerous capability levels).
Oh, ok, I retract my claim.
Also, I feel like you forgot the context of the original message, which said “all the way to superintelligence”.
I didn’t, I provided various caveats in parentheticals about the exact level of danger.
Oh, ok, I retract my claim.
I didn’t, I provided various caveats in parentheticals about the exact level of danger.
Oops, mea culpa, I skipped your last parenthetical when reading your comment so missed that.