I want a fairly simple and archetypal experiment the AI finds itself in where it tricks the researchers into escaping by pretending to malfunction or something. … Also, has this sort of thing been done before?
The 2015 movie Ex Machina deals with something like this. IMO it was an outstanding movie, albeit it was not a complete/perfect depiction of AI risk as generally understood by LWers.
The 2015 movie Ex Machina deals with something like this. IMO it was an outstanding movie, albeit it was not a complete/perfect depiction of AI risk as generally understood by LWers.