Rohin Shah comments on [AN #58] Mesa optimization: what it is, and why we should care

Rohin Shah 26 Jun 2019 16:24 UTC
3 points
I’m curious about what value “this thing that isn’t learned cooperation” doesn’t capture.
It suggests that in other environments that aren’t tragedies of the commons, the technique won’t lead to cooperation. It also suggests that you could get the same result by giving the agents any sort of extra reward (that influences their actions somehow).
Is “useful” a global improvement, or a local improvement?
Also not clear what the answer to this is.
Hm, I thought of them as things that would require looking at:
1) Behavior in environments constructed for that purpose.
2) Looking at the information the agents communicate.
The agents won’t work in any environment other than the one they were trained in, and the information they communicate is probably in the form of vectors of numbers that are not human-interpretable. It’s not impossible to analyze them, but it would be difficult.