Interesting, but it is hard for me to think of an example where (i) we can mechanically generate a huge database of formally defined questions, (ii) the questions are not human understandable a priori and (iii) the questions can be made human understandable. Any example of i+ii that I can think of involves things like “given a dynamical system of type T whose initial conditions are described by the following huge table of numbers, will the 2356th parameter in the system’s state after 746.892 time of evolution be larger than 21.783?”
Also, I guess the implicit assumption is that there is an additional safety mechanism in place which prevents the AI from modifying the human’s mind in some terrible way?
Interesting, but it is hard for me to think of an example where (i) we can mechanically generate a huge database of formally defined questions, (ii) the questions are not human understandable a priori and (iii) the questions can be made human understandable. Any example of i+ii that I can think of involves things like “given a dynamical system of type T whose initial conditions are described by the following huge table of numbers, will the 2356th parameter in the system’s state after 746.892 time of evolution be larger than 21.783?”
Also, I guess the implicit assumption is that there is an additional safety mechanism in place which prevents the AI from modifying the human’s mind in some terrible way?