RSS

Buck

Karma: 11,174

CEO at Redwood Research.

AI safety is a highly collaborative field—almost all the points I make were either explained to me by someone else, or developed in conversation with other people. I’m saying this here because it would feel repetitive to say “these ideas were developed in collaboration with various people” in all my comments, but I want to have it on the record that the ideas I present were almost entirely not developed by me in isolation.

A sketch of an AI con­trol safety case

Jan 30, 2025, 5:28 PM
57 points
0 comments5 min readLW link

Ten peo­ple on the inside

BuckJan 28, 2025, 4:41 PM
139 points
28 comments4 min readLW link

Early Ex­per­i­ments in Hu­man Au­dit­ing for AI Control

Jan 23, 2025, 1:34 AM
27 points
0 comments7 min readLW link