It’s true that gazing at the cosmos has a history of leading to important discoveries, but mechanistic interpretability isn’t gazing at the cosmos, it’s gazing at the weights of the neural network.
It’s true that gazing at the cosmos has a history of leading to important discoveries, but mechanistic interpretability isn’t gazing at the cosmos, it’s gazing at the weights of the neural network.