Agreed, though of course as always, there is the issue that that’s an intentional-stance way to describe what a language model does: “they will generally try to continue it in that style.” Hence mechinterp, which tries to (heh) move to a mechanical stance, which will likely be something like “when you give them a [whatever] text to continue, it will match [some list of features], which will then activate [some part of the network that we will name later], which implements the style that matches those features”.
(incidentally, I think there’s some degree to which people who strongly believe that artificial NNs are alien shoggoths are underestimating the degree to which their own brains are also alien shoggoths. but that doesn’t make it a good model of either thing. the only reason it was ever an improvement over a previous word was when people had even more misleading intuitive-sketch models.)
Agreed, though of course as always, there is the issue that that’s an intentional-stance way to describe what a language model does: “they will generally try to continue it in that style.” Hence mechinterp, which tries to (heh) move to a mechanical stance, which will likely be something like “when you give them a [whatever] text to continue, it will match [some list of features], which will then activate [some part of the network that we will name later], which implements the style that matches those features”.
(incidentally, I think there’s some degree to which people who strongly believe that artificial NNs are alien shoggoths are underestimating the degree to which their own brains are also alien shoggoths. but that doesn’t make it a good model of either thing. the only reason it was ever an improvement over a previous word was when people had even more misleading intuitive-sketch models.)