Smallwood: how could you determine that the AI provided the actual source code rather than very similar source code that has been subtly altered so as to ensure “good” behavior once it is let out of the simulated box?
Smallwood: how could you determine that the AI provided the actual source code rather than very similar source code that has been subtly altered so as to ensure “good” behavior once it is let out of the simulated box?