That’s not a thing that people know how to do. A task, maybe? Wrapper-minds don’t seem likely as first AGIs. But they might come later, heralding transitive AI risk.
arguably likely to show non-aligned behaviour, if it isn’t aligned
What’s “aligned” here? It’s an umbrella term for things that are good with respect to AI risk, and means very little in particular. In the context of something feasible in practice, it means even less. Like, are you aligned?
In this context, what I mean by “aligned” is something like won’t prevent itself being shut off and will not do things that could be considered bad, such as hacking or manipulating people.
My impression was that actually being able to give an AI a goal is something that might be learnt at some point. You said “A task, maybe?”. I don’t know what the meaningful distinction is between a task and a goal in this case.
I won’t be able to keep up with the technical side of things here, I just wanted my idea to be out there, in case it is helpful in some way.
I won’t be able to keep up with the technical side of things here
What’s the pointof that? It’s mostly unfamiliar, not really technical. I wish there was something technical that would be relevant to say on the topic.
That’s not a thing that people know how to do. A task, maybe? Wrapper-minds don’t seem likely as first AGIs. But they might come later, heralding transitive AI risk.
What’s “aligned” here? It’s an umbrella term for things that are good with respect to AI risk, and means very little in particular. In the context of something feasible in practice, it means even less. Like, are you aligned?
In this context, what I mean by “aligned” is something like won’t prevent itself being shut off and will not do things that could be considered bad, such as hacking or manipulating people.
My impression was that actually being able to give an AI a goal is something that might be learnt at some point. You said “A task, maybe?”. I don’t know what the meaningful distinction is between a task and a goal in this case.
I won’t be able to keep up with the technical side of things here, I just wanted my idea to be out there, in case it is helpful in some way.
What’s the point of that? It’s mostly unfamiliar, not really technical. I wish there was something technical that would be relevant to say on the topic.