Matt Goldenberg comments on Trying to Make a Treacherous Mesa-Optimizer

Matt Goldenberg 12 Nov 2022 12:22 UTC
2 points
0
I am really curious about the disagree votes here, do people think this is not an empirical work demonstrating mesa-optimization?
- sid 12 Nov 2022 17:53 UTC
  1 point
  0
  Parent
  On page 8 of the paper they say, “our work does not demonstrate or address mesa-optimization”. I think it’s because none of the agents in their paper has learned an optimization process (i.e. is running something like a search algorithm on the inside).
  - Lauro Langosco 14 Nov 2022 14:17 UTC
    5 points
    1
    Parent
    FWIW I believe I wrote that sentence and I now think this is a matter of definition, and that it’s actually reasonable to think of an agent that e.g. reliably solves a maze as an optimizer even if it does not use explicit search internally.