Charlie Steiner comments on ELK prize results

Charlie Steiner 9 Mar 2022 2:44 UTC
LW: 10 AF: 4
AF
Bravo! Honestly the thing I’m most impressed with here is your blazing speed.

I like the “make it useful to another AI” idea, in part because I think it has interesting failure modes. The dynamic between the predictor and the user is apparently adversarial (so you might imagine that training the predictor on a fixed user will lead to the user getting deceived, while training the user on a fixed predictor leads to deceptions being uncovered). But also, there’s a cooperative dynamic where given a fixed evaluation function for how well the user does, both the predictor and the user are trying to find exploits in the evaluator.