I don’t think the game is an alarming capability gain at all—I agree with LawrenceC’s comment below. It’s more of a “gain-of-function research” scenario to me. Like, maybe we shouldn’t deliberately try to train a model to be good at this? If you’ve ever played Diplomacy, you know the whole point of the game is manipulating and backstabbing your way to world domination. I think it’s great that the research didn’t actually seem to come up with any scary generalizable techniques or dangerous memetics, but I think ideally shouldn’t even be trying in the first place.
I don’t think the game is an alarming capability gain at all—I agree with LawrenceC’s comment below. It’s more of a “gain-of-function research” scenario to me. Like, maybe we shouldn’t deliberately try to train a model to be good at this? If you’ve ever played Diplomacy, you know the whole point of the game is manipulating and backstabbing your way to world domination. I think it’s great that the research didn’t actually seem to come up with any scary generalizable techniques or dangerous memetics, but I think ideally shouldn’t even be trying in the first place.