Rohin Shah comments on Mesa-Search vs Mesa-Control

Rohin Shah Aug 19, 2020, 6:05 PM
LW: 14 AF: 9
AF
Here on LW / AF, “mesa optimization” seems to only apply if there’s some sort of “general” learning algorithm, especially one that is “using search”, for reasons that have always been unclear to me. Some relevant posts taking the opposite perspective (which I endorse):
Is the term mesa optimizer too narrow?
Why is pseudo-alignment “worse” than other ways ML can fail to generalize?
What links here?
- abramdemski's comment on Mesa-Search vs Mesa-Control by abramdemski (Aug 19, 2020, 8:21 PM; 23 points)