RSS

Jeremy Gillen

Karma: 1,786

I’m interested in doing in-depth dialogues to find cruxes. Message me if you are interested in doing this.

I do alignment research, mostly stuff that is vaguely agent foundations. Currently doing independent alignment research on ontology identification. Formerly on Vivek’s team at MIRI. Most of my writing before mid 2023 is not representative of my current views about alignment difficulty.

De­tect Good­hart and shut down

Jeremy Gillen22 Jan 2025 18:45 UTC
60 points
17 comments7 min readLW link