habryka comments on Jemist’s Shortform

habryka 28 Oct 2024 19:58 UTC
5 points
2
True if you don’t count the training process as part of the optimizer (which is a choice that sometimes makes sense and sometimes doesn’t). If you count the training process as part of the optimizer, then you can of course just flip your loss function or RL signal most of the time.