Evan Hubinger: In my paper, I theorized about the mesa optimizer as a cautionary tale
Capabilities researchers: At long last, we have created the Mesa Layer from classic alignment paper Risks From Learned Optimization (Hubinger, 2019).
Evan Hubinger: In my paper, I theorized about the mesa optimizer as a cautionary tale
Capabilities researchers: At long last, we have created the Mesa Layer from classic alignment paper Risks From Learned Optimization (Hubinger, 2019).