RSS

Ollie J

Karma: 186

[Paper] AI Sand­bag­ging: Lan­guage Models can Strate­gi­cally Un­der­perform on Evaluations

Jun 13, 2024, 10:04 AM
84 points
10 comments2 min readLW link
(arxiv.org)

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

Nov 8, 2023, 11:37 AM
49 points
0 comments18 min readLW link

ChatGPT banned in Italy over pri­vacy concerns

Ollie JMar 31, 2023, 5:33 PM
18 points
4 comments1 min readLW link
(www.bbc.co.uk)

Whisper’s Wild Implications

Ollie JJan 3, 2023, 12:17 PM
19 points
6 comments5 min readLW link