RSS

Jonah Brown-Cohen

Karma: 77

On scal­able over­sight with weak LLMs judg­ing strong LLMs

8 Jul 2024 8:59 UTC
48 points
18 comments7 min readLW link
(arxiv.org)

De­bate, Or­a­cles, and Obfus­cated Arguments

20 Jun 2024 23:14 UTC
40 points
2 comments21 min readLW link