Funnily enough, Nvidia’s recent 340B parameter chat assistant release didboast about being number one on the reward model leaderboard, however, the reward model only claims to capture helpfulness and a bunch of other metrics of usefulness to the individual user. But that’s still pretty good.
Funnily enough, Nvidia’s recent 340B parameter chat assistant release did boast about being number one on the reward model leaderboard, however, the reward model only claims to capture helpfulness and a bunch of other metrics of usefulness to the individual user. But that’s still pretty good.