Thomas Kwa

Karma: 4,966

Was on Vivek Hebbar’s team at MIRI, now working with Adrià Garriga-Alonso onvarious empirical alignment projects.

I’m looking for projects in interpretability, activation engineering, and control/​oversight; DM me if you’re interested in working with me.

I have signed no contracts or agreements whose existence I cannot mention.