By “AI design” I’m assuming you are referring to the learning algorithm and runtime/inference algorithm of the agent A in the amplification scheme.
In that case, I hadn’t thought of the system as only needing to work with respect to the learning algorithm. Maybe it’s possible/useful to reason about limited versions which are corrigible with respect to some simple current technique (just not very competent).
By “AI design” I’m assuming you are referring to the learning algorithm and runtime/inference algorithm of the agent A in the amplification scheme.
In that case, I hadn’t thought of the system as only needing to work with respect to the learning algorithm. Maybe it’s possible/useful to reason about limited versions which are corrigible with respect to some simple current technique (just not very competent).