RSS

McKennaFitzgerald

Karma: 246

Eval­u­at­ing Over­sight Ro­bust­ness with In­cen­tivized Re­ward Hacking

Apr 20, 2025, 4:53 PM
1 point
0 comments15 min readLW link

Ta­lent Needs of Tech­ni­cal AI Safety Teams

May 24, 2024, 12:36 AM
117 points
65 comments14 min readLW link

MATS Win­ter 2023-24 Retrospective

May 11, 2024, 12:09 AM
86 points
28 comments49 min readLW link

MATS Sum­mer 2023 Retrospective

Dec 1, 2023, 11:29 PM
77 points
34 comments26 min readLW link