I agree that this task is far “easier task than a standard AI box experiment”. I attacked it from a different angle though (HarryPrime can easily and honestly convince Voldemort he is doomed unless HarryPrime helps him).:
http://lesswrong.com/r/discussion/lw/lsp/harry_potter_and_the_methods_of_rationality/c206
I agree that this task is far “easier task than a standard AI box experiment”. I attacked it from a different angle though (HarryPrime can easily and honestly convince Voldemort he is doomed unless HarryPrime helps him).:
http://lesswrong.com/r/discussion/lw/lsp/harry_potter_and_the_methods_of_rationality/c206