RSS

tamera

Karma: 267

ML engineer turned AI safety researcher.

Reach out to me via email (tamera.lanham at gmail.com) or facebook /​​ messenger (Tamera Lanham).

Mea­sur­ing and Im­prov­ing the Faith­ful­ness of Model-Gen­er­ated Rea­son­ing

Jul 18, 2023, 4:36 PM
111 points
15 comments6 min readLW link1 review

Ex­ter­nal­ized rea­son­ing over­sight: a re­search di­rec­tion for lan­guage model alignment

tameraAug 3, 2022, 12:03 PM
135 points
23 comments6 min readLW link