I suspect that seeing the logs would have made Eliezer seem like a horrible human being. Most people who hear of AI Box imagine a convincing argument, when to me it seems more plausible to exploit issues in people’s sense of narrative or emotion.
Yup, certainly possible. Some later attempts at box-escaping have certainly gone that way. (I don’t know whether any successful ones have. There don’t seem to have been a lot of successes since Eliezer’s.)
I suspect that seeing the logs would have made Eliezer seem like a horrible human being. Most people who hear of AI Box imagine a convincing argument, when to me it seems more plausible to exploit issues in people’s sense of narrative or emotion.
Yup, certainly possible. Some later attempts at box-escaping have certainly gone that way. (I don’t know whether any successful ones have. There don’t seem to have been a lot of successes since Eliezer’s.)