p.b. comments on Rauno’s Shortform

p.b. 18 Nov 2024 16:29 UTC
2 points
0
There was one comment on twitter that the RLHF-finetuned models also still have the ability to play chess pretty well, just their input/output-formatting made it impossible for them to access this ability (or something along these lines). But apparently it can be recovered with a little finetuning.
- ZY 18 Nov 2024 19:42 UTC
  1 point
  0
  Parent
  Yeah that makes sense; the knowledge should still be there, just need to re-shift the distribution “back”