Gurkenglas comments on Speculations against GPT-n writing alignment papers

Gurkenglas 8 Jun 2021 12:59 UTC
2 points
Take into account that the AI that interprets needs not be the same as the network being interpreted.
Why do you think that a mere autocomplete engine could not do interpretability work? It has been demonstrated to write comments for code and code for specs.