If I’m reading your question right I think the answer is:
I’m going to make a bunch of claims about the algorithms underlying human intelligence, and then talk about safely using algorithms with those properties. If our future AGI algorithms have those properties, then this series will be useful, and I would be inclined to call such an algorithm “brain-like”.
i.e. The distinction depends on whether or not a given architecture has some properties Steve will mention later. Which, given Steve’s work, are probably the key properties of “A learned population of Compositional Generative Models + A largely hardcoded Steering Subsystem”.
OK that’s fair, I didn’t really answer Jon’s question.
So: What makes an AI algorithm brain-like for present purposes? Probably the biggest single thing is whether it’s an actor-critic model-based RL algorithm. If yes, there’s a good chance that some of the things I talk about in this series will be at least somewhat applicable to it. If no, probably not.
But I’m not too sure and wouldn’t want to make any promises. “Actor-critic model-based RL” is AFAIK a big diverse group of a thousand different models that work in a thousand different ways. Probably all of them have some kinds of safety-relevant differences from what I’m gonna talk about. But it’s hard for me to say anything in general.
If I’m reading your question right I think the answer is:
i.e. The distinction depends on whether or not a given architecture has some properties Steve will mention later. Which, given Steve’s work, are probably the key properties of “A learned population of Compositional Generative Models + A largely hardcoded Steering Subsystem”.
OK that’s fair, I didn’t really answer Jon’s question.
So: What makes an AI algorithm brain-like for present purposes? Probably the biggest single thing is whether it’s an actor-critic model-based RL algorithm. If yes, there’s a good chance that some of the things I talk about in this series will be at least somewhat applicable to it. If no, probably not.
But I’m not too sure and wouldn’t want to make any promises. “Actor-critic model-based RL” is AFAIK a big diverse group of a thousand different models that work in a thousand different ways. Probably all of them have some kinds of safety-relevant differences from what I’m gonna talk about. But it’s hard for me to say anything in general.