samarnesen comments on GPT-3.5 judges can supervise GPT-4o debaters in capability asymmetric debates

samarnesen 28 Aug 2024 3:25 UTC
5 points
0
This seems like really interesting work! Would you be able to share any example transcripts from some of these debates? Since RLHF’ed models often shy away from combativeness, I’m curious as to the form of GPT-4′s rebuttals (especially for questions where the judge gets it right after reading the debate but wrong otherwise)
- Charlie George 4 Sep 2024 20:48 UTC
  2 points
  0
  Parent
  I’ve added a markdown file with transcripts to the repo.