Lennart Buerger

Karma: 25

I studied Physics in Heidelberg and Oxford and am now doing research on AI Alignment and LLM Interpretability, currently as part of my Master’s thesis in Fred Hamprecht’s SciAI Lab (Heidelberg, Germany). If you want to discuss something, have questions or would like to collaborate, feel free to drop me a message!

Truth is Univer­sal: Ro­bust De­tec­tion of Lies in LLMs

Lennart BuergerJul 19, 2024, 2:07 PM
24 points
3 comments2 min readLW link