Buck comments on The case for becoming a black-box investigator of language models

Buck 13 May 2022 16:00 UTC
LW: 2 AF: 1
AF
Yeah I think things like this are reasonable. I think that these are maybe too hard and high-level for a lot of the things I care about—I’m really interested in questions like “how much less reliable is the model about repeating names when the names are 100 tokens in the past instead of 50”, which are much simpler and lower level.