RSS

Leon Lang

Karma: 1,375

I’m a PhD student at the University of Amsterdam. I have research experience in multivariate information theory and equivariant deep learning and recently got very interested into AI alignment. https://​​langleon.github.io/​​

[Paper Blog­post] When Your AIs De­ceive You: Challenges with Par­tial Ob­serv­abil­ity in RLHF

Leon Lang22 Oct 2024 13:57 UTC
47 points
0 comments18 min readLW link
(arxiv.org)