RSS

tamera

Karma: 262

ML engineer turned AI safety researcher.

Reach out to me via email (tamera.lanham at gmail.com) or facebook /​​ messenger (Tamera Lanham).

Mea­sur­ing and Im­prov­ing the Faith­ful­ness of Model-Gen­er­ated Rea­son­ing

18 Jul 2023 16:36 UTC
111 points
14 comments6 min readLW link

Ex­ter­nal­ized rea­son­ing over­sight: a re­search di­rec­tion for lan­guage model alignment

tamera3 Aug 2022 12:03 UTC
130 points
23 comments6 min readLW link