Benjamin Hilton

Karma: 381

Head of Alignment at UK AI Security Institute (AISI). Previously 80,000 Hours, HM Treasury, Cabinet Office, Department for International Trade, Imperial College London.

An alignment safety case sketch based on debate

Marie_DB, Jacob Pfau, Benjamin Hilton and Geoffrey Irving

May 8, 2025, 3:02 PM

57 points

21 comments25 min readLW link

(arxiv.org)

UK AISI’s Alignment Team: Research Agenda

Benjamin Hilton, Jacob Pfau, Marie_DB and Geoffrey Irving

May 7, 2025, 4:33 PM

113 points

2 comments11 min readLW link

A sketch of an AI control safety case

Tomek Korbak, joshc, Benjamin Hilton, Buck and Geoffrey Irving

Jan 30, 2025, 5:28 PM

57 points

0 comments5 min readLW link

Automation collapse

Geoffrey Irving, Tomek Korbak and Benjamin Hilton

Oct 21, 2024, 2:50 PM

72 points

9 comments7 min readLW link

Benjamin Hilton Feb 7, 2024, 7:15 PM
3 points
−2
in reply to: Remmelt’s comment on: Why I think it’s net harmful to do technical safety research at AGI labs
[x-posted from EA forum]

Hi Remmelt,
Thanks for sharing your concerns, both with us privately and here on the forum. These are tricky issues and we expect people to disagree about how to about how to weigh all the considerations — so it’s really good to have open conversations about them.
Ultimately, we disagree with you that it’s net harmful to do technical safety research at AGI labs. In fact, we think it can be the best career step for some of our readers to work in labs, even in non-safety roles. That’s the core reason why we list these roles on our job board.
We argue for this position extensively in my article on the topic (and we only list roles consistent with the considerations in that article).
Some other things we’ve published on this topic in the last year or so:
- A range of opinions from anonymous experts about the upsides and downsides of working on AI capabilities
- How policy roles in AI companies can be valuable for career capital and for direct impact (as well as the potential downsides)
- We recently released a podcast episode with Nathan Labenz on some of the controversy around OpenAI, including his concerns about some of their past safety practices, whether ChatGPT’s release was good or bad, and why its mission of developing AGI may be too risky.
Benjamin

Should you work at a leading AI lab? (including in non-safety roles)

Benjamin HiltonJul 25, 2023, 4:29 PM

7 points

0 comments12 min readLW link

AI safety technical research—Career review

Benjamin HiltonJul 17, 2023, 3:34 PM

14 points

0 comments29 min readLW link

How many people are working (directly) on reducing existential risk from AI?

Benjamin HiltonJan 18, 2023, 8:46 AM

20 points

1 comment4 min readLW link

(80000hours.org)

New 80,000 Hours problem profile on existential risks from AI

Benjamin HiltonAug 31, 2022, 5:36 PM

28 points

6 comments7 min readLW link

(80000hours.org)