tgb comments on All AGI Safety questions welcome (especially basic ones) [April 2023]

tgb 17 Apr 2023 12:26 UTC
3 points
0
Thank you. I was completely missing that they used a second ‘preference’ model to score outputs for the RL. I’m surprised that works!