I’d say these are sensible enough thoughts on the social and ethical aspects of alignment. But that’s already only one half of the process compared to the other side, the technical aspects, which include simply “ok, we’ve decided on a few principles, now how the hell do we guarantee the AI actually sticks to them?”.
I’d say these are sensible enough thoughts on the social and ethical aspects of alignment. But that’s already only one half of the process compared to the other side, the technical aspects, which include simply “ok, we’ve decided on a few principles, now how the hell do we guarantee the AI actually sticks to them?”.