Edoardo Pona

Karma: 47

I am interested in Alignment, Mechanistic Interpretability, Agents, and the theory of how neural networks work.

Thinking About Propensity Evaluations

Aug 19, 2024, 9:23 AM

10 points

Aug 19, 2024, 9:07 AM

13 points

Edoardo PonaMay 16, 2023, 7:24 AM

21 points