hannagabor comments on Frontier Models are Capable of In-context Scheming

hannagabor 9 Dec 2024 15:35 UTC
1 point
0
I was wondering how the models perform on the multiplication test by default. If they were performing better when incentivized to do well than they were by default, that might mean they are not using their full capabilities by default.