RSS

Rohin Shah

Karma: 15,605

Research Scientist at Google DeepMind. Creator of the Alignment Newsletter. http://​​rohinshah.com/​​

Google Deep­Mind: An Ap­proach to Tech­ni­cal AGI Safety and Security

Rohin ShahApr 5, 2025, 10:00 PM
73 points
12 comments18 min readLW link
(arxiv.org)

Nega­tive Re­sults for SAEs On Down­stream Tasks and Depri­ori­tis­ing SAE Re­search (GDM Mech In­terp Team Progress Up­date #2)

Mar 26, 2025, 7:07 PM
111 points
15 comments29 min readLW link
(deepmindsafetyresearch.medium.com)

AGI Safety & Align­ment @ Google Deep­Mind is hiring

Rohin Shah17 Feb 2025 21:11 UTC
102 points
19 comments10 min readLW link

A short course on AGI safety from the GDM Align­ment team

14 Feb 2025 15:43 UTC
103 points
2 comments1 min readLW link
(deepmindsafetyresearch.medium.com)