Interesting, thanks for the reply. I agree that it could develop superhuman ability in some domains, even if that ability doesn’t manifest in the model’s output, so that seems promising (although not very scaleable). I haven’t read on mesa optimizers yet.
Interesting, thanks for the reply. I agree that it could develop superhuman ability in some domains, even if that ability doesn’t manifest in the model’s output, so that seems promising (although not very scaleable). I haven’t read on mesa optimizers yet.