RSS

Daniel Tan

Karma: 1,087

AI alignment researcher. Interested in understanding reasoning in language models.

https://​​dtch1997.github.io/​​

Open prob­lems in emer­gent misalignment

Mar 1, 2025, 9:47 AM
76 points
13 comments7 min readLW link