RSS

AdamGleave

Karma: 913

AI Safety in a World of Vuln­er­a­ble Ma­chine Learn­ing Systems

Mar 8, 2023, 2:40 AM
70 points
28 comments29 min readLW link
(far.ai)

CIRL Cor­rigi­bil­ity is Fragile

Dec 21, 2022, 1:40 AM
58 points
8 comments12 min readLW link