RSS

Sam F. Brown

Karma: 378

March 2025 Oxford Ra­tion­al­ish Pub Social

Mar 5, 2025, 8:07 AM
1 point
0 comments1 min readLW link

Feb 2025 Oxford Ra­tion­al­ish Pub Social

Feb 6, 2025, 12:53 PM
1 point
0 comments1 min readLW link

Oxford Ra­tion­al­ish Pub Social

Jan 2, 2025, 12:10 AM
1 point
0 comments1 min readLW link

OxRat De­cem­ber Pub Social

Nov 28, 2024, 1:13 PM
1 point
0 comments1 min readLW link

Oxford Ra­tion­al­ish—Novem­ber Pub

Nov 4, 2024, 8:25 PM
1 point
0 comments1 min readLW link

Oxford ACX Any­where—OxRat 2024

Sep 3, 2024, 7:18 PM
1 point
0 comments1 min readLW link

OxRat Septem­ber 2024 Pub Social

Sep 3, 2024, 6:57 PM
1 point
0 comments1 min readLW link

OxRat Au­gust Pub Social

Aug 5, 2024, 8:16 PM
1 point
0 comments1 min readLW link

Auto-En­hance: Devel­op­ing a meta-bench­mark to mea­sure LLM agents’ abil­ity to im­prove other agents

Jul 22, 2024, 12:33 PM
20 points
0 comments14 min readLW link

OxRat July Pub Social

Jul 4, 2024, 2:36 PM
1 point
0 comments1 min readLW link

[Paper] AI Sand­bag­ging: Lan­guage Models can Strate­gi­cally Un­der­perform on Evaluations

Jun 13, 2024, 10:04 AM
84 points
10 comments2 min readLW link
(arxiv.org)

Oxford Ra­tion­al­ish—June Pub

Jun 10, 2024, 11:44 AM
1 point
0 comments1 min readLW link

OxRat ACX Mee­tups Every­where—Spring 2024

Mar 16, 2024, 7:41 PM
7 points
0 comments1 min readLW link

OxRat March Pub Social

Mar 10, 2024, 9:27 PM
1 point
0 comments1 min readLW link

Oxford Ra­tion­al­ish—Dec Pub

Dec 8, 2023, 8:20 PM
1 point
0 comments1 min readLW link

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

Nov 8, 2023, 11:37 AM
49 points
0 comments18 min readLW link

Oxford Ra­tion­al­ish—Sept Pub

Sam F. BrownSep 19, 2023, 10:03 AM
4 points
0 comments1 min readLW link

OxRat ACX Mee­tups Every­where 2023

Sam F. BrownAug 30, 2023, 3:15 AM
4 points
0 comments1 min readLW link

Oxford, UK – ACX Mee­tups Every­where Fall 2023

Sam F. BrownAug 25, 2023, 11:33 PM
4 points
0 comments1 min readLW link

Oxford Ra­tion­al­ish—July Pub

Sam F. BrownJul 15, 2023, 10:10 AM
4 points
0 comments1 min readLW link