DanielFilan comments on EIS II: What is “Interpretability”?

DanielFilan 28 Mar 2023 23:21 UTC
LW: 4 AF: 3
2
AF
I guess this proves the superiority of the mechanistic interpretability technique “note that it is mechanistically possible for your model to say that things are gorillas” :P