RSS

DanielFilan

Karma: 8,626

AXRP Epi­sode 40 - Ja­son Gross on Com­pact Proofs and Interpretability

DanielFilanMar 28, 2025, 6:40 PM
22 points
0 comments89 min readLW link

AXRP Epi­sode 38.8 - David Du­ve­naud on Sab­o­tage Eval­u­a­tions and the Post-AGI Future

DanielFilanMar 1, 2025, 1:20 AM
13 points
0 comments13 min readLW link

AXRP Epi­sode 38.7 - An­thony Aguirre on the Fu­ture of Life Institute

DanielFilanFeb 9, 2025, 1:10 AM
10 points
0 comments12 min readLW link

AXRP Epi­sode 38.6 - Joel Lehman on Pos­i­tive Vi­sions of AI

DanielFilanJan 24, 2025, 11:00 PM
10 points
0 comments9 min readLW link

AXRP Epi­sode 38.5 - Adrià Gar­riga-Alonso on De­tect­ing AI Scheming

DanielFilanJan 20, 2025, 12:40 AM
9 points
0 comments16 min readLW link

MATS men­tor selection

Jan 10, 2025, 3:12 AM
43 points
11 comments6 min readLW link

AXRP Epi­sode 38.4 - Sha­keel Hashim on AI Journalism

DanielFilanJan 5, 2025, 12:20 AM
11 points
0 comments12 min readLW link

AXRP Epi­sode 38.3 - Erik Jen­ner on Learned Look-Ahead

DanielFilanDec 12, 2024, 5:40 AM
20 points
0 comments16 min readLW link

AXRP Epi­sode 39 - Evan Hub­inger on Model Or­ganisms of Misalignment

DanielFilanDec 1, 2024, 6:00 AM
41 points
0 comments67 min readLW link

AXRP Epi­sode 38.2 - Jesse Hoogland on Sin­gu­lar Learn­ing Theory

DanielFilanNov 27, 2024, 6:30 AM
34 points
0 comments10 min readLW link

AXRP Epi­sode 38.1 - Alan Chan on Agent Infrastructure

DanielFilanNov 16, 2024, 11:30 PM
12 points
0 comments14 min readLW link

AXRP Epi­sode 38.0 - Zhijing Jin on LLMs, Causal­ity, and Multi-Agent Systems

DanielFilanNov 14, 2024, 7:00 AM
14 points
0 comments12 min readLW link

MATS AI Safety Strat­egy Cur­ricu­lum v2

Oct 7, 2024, 10:44 PM
42 points
6 comments13 min readLW link

AXRP Epi­sode 37 - Jaime Sevilla on Fore­cast­ing AI

DanielFilanOct 4, 2024, 9:00 PM
21 points
3 comments56 min readLW link

AXRP Epi­sode 36 - Adam Shai and Paul Riech­ers on Com­pu­ta­tional Mechanics

DanielFilanSep 29, 2024, 5:50 AM
25 points
0 comments55 min readLW link

AXRP Epi­sode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization

DanielFilanAug 24, 2024, 10:30 PM
21 points
0 comments74 min readLW link

AXRP Epi­sode 34 - AI Eval­u­a­tions with Beth Barnes

DanielFilanJul 28, 2024, 3:30 AM
23 points
0 comments69 min readLW link

Why keep a di­ary, and why wish for large lan­guage models

DanielFilanJun 14, 2024, 4:10 PM
9 points
1 comment2 min readLW link
(danielfilan.com)

AXRP Epi­sode 33 - RLHF Prob­lems with Scott Emmons

DanielFilanJun 12, 2024, 3:30 AM
34 points
0 comments56 min readLW link

AXRP Epi­sode 32 - Un­der­stand­ing Agency with Jan Kulveit

DanielFilanMay 30, 2024, 3:50 AM
20 points
0 comments53 min readLW link