Slight subtlety—GPT-3 might have a bias in its training data towards things related to AI and things of interest to the internet (maybe they scraped a lot of forums as well as just google). I picked some random names from non-western countries—for example, this Estonian politician gets 33,000 hits on Google and wasn’t recognised by GPT-3. It thought he was a software developer (though from Estonia). Might mean that if you’re estimating sample efficiency from Google search hits on people involved with AI, you’ll end up overestimating sample efficiency.
Slight subtlety—GPT-3 might have a bias in its training data towards things related to AI and things of interest to the internet (maybe they scraped a lot of forums as well as just google). I picked some random names from non-western countries—for example, this Estonian politician gets 33,000 hits on Google and wasn’t recognised by GPT-3. It thought he was a software developer (though from Estonia). Might mean that if you’re estimating sample efficiency from Google search hits on people involved with AI, you’ll end up overestimating sample efficiency.