Esben Kran comments on Is there a list of projects to get started with Interpretability?

Esben Kran 8 Sep 2022 13:55 UTC
2 points
0
There are a few interpretability ideas on aisafetyideas.com, e.g. mechanistic interpretability list, blackbox investigations list, and automating auditing list.