RSS

Benjamin Hilton

Karma: 381

Head of Alignment at UK AI Security Institute (AISI). Previously 80,000 Hours, HM Treasury, Cabinet Office, Department for International Trade, Imperial College London.

An al­ign­ment safety case sketch based on debate

May 8, 2025, 3:02 PM
57 points
21 comments25 min readLW link
(arxiv.org)

UK AISI’s Align­ment Team: Re­search Agenda

May 7, 2025, 4:33 PM
113 points
2 comments11 min readLW link

A sketch of an AI con­trol safety case

Jan 30, 2025, 5:28 PM
57 points
0 comments5 min readLW link

Au­toma­tion collapse

Oct 21, 2024, 2:50 PM
72 points
9 comments7 min readLW link

Should you work at a lead­ing AI lab? (in­clud­ing in non-safety roles)

Benjamin HiltonJul 25, 2023, 4:29 PM
7 points
0 comments12 min readLW link

AI safety tech­ni­cal re­search—Ca­reer review

Benjamin HiltonJul 17, 2023, 3:34 PM
14 points
0 comments29 min readLW link

How many peo­ple are work­ing (di­rectly) on re­duc­ing ex­is­ten­tial risk from AI?

Benjamin HiltonJan 18, 2023, 8:46 AM
20 points
1 comment4 min readLW link
(80000hours.org)

New 80,000 Hours prob­lem pro­file on ex­is­ten­tial risks from AI

Benjamin HiltonAug 31, 2022, 5:36 PM
28 points
6 comments7 min readLW link
(80000hours.org)