I’m talking about publishing a technical design of Friendliness that’s conserved under self-improving optimization without also publishing (in math and code) exactly what is meant by self-improving optimization. CEV is a good first step, but a programmatically reusable solution it is not.
Before you the terrible blank wall stretches up and up and up, unimaginably far out of reach. And there is also the need to solve it, really solve it, not “try your best”.
Isn’t CEV an attempt to separate F and AI parts?
It’s half of the F. Between the CEV and the AGI is the ‘goal stability under recursion’ part.
It’s a good first step.
I don’t understand your impossibility comment, then.
I’m talking about publishing a technical design of Friendliness that’s conserved under self-improving optimization without also publishing (in math and code) exactly what is meant by self-improving optimization. CEV is a good first step, but a programmatically reusable solution it is not.
On doing the impossible:
OK, I understand that much better now. Great point.