in domains complicated enough that perfect play is not possible, less intelligent agents will not be able to predict the exact moves made by more intelligent agents.
Which is why you can’t use decision theory to solve AI safety. But everyone seems to have quietly given up on that idea.
Interesting point, I haven’t thought about it from that perspective yet! Do you happen to have a reference at hand, I’d love to read more about that. (No worries if not, then I’ll do some digging myself).
Which is why you can’t use decision theory to solve AI safety. But everyone seems to have quietly given up on that idea.
Interesting point, I haven’t thought about it from that perspective yet! Do you happen to have a reference at hand, I’d love to read more about that. (No worries if not, then I’ll do some digging myself).