Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Jonah Brown-Cohen
Karma:
77
All
Posts
Comments
New
Top
Old
On scalable oversight with weak LLMs judging strong LLMs
zac_kenton
,
Noah Siegel
,
janos
,
Jonah Brown-Cohen
,
Samuel Albanie
,
David Lindner
and
Rohin Shah
8 Jul 2024 8:59 UTC
48
points
18
comments
7
min read
LW
link
(arxiv.org)
Debate, Oracles, and Obfuscated Arguments
Jonah Brown-Cohen
and
Geoffrey Irving
20 Jun 2024 23:14 UTC
40
points
2
comments
21
min read
LW
link
Back to top