Yeah for sure!
For PII—A relatively recent survey paper: https://arxiv.org/pdf/2403.05156
For pii/memorization generally:
Lab’s LLM safety section typically has a pii/memorization section
For demographics inference:
For bias/fairness—survey paper: https://arxiv.org/pdf/2309.00770
This is probably far from complete, but I think the references in the survey paper, and in the Staab et al. paper should have some additional good ones as well.
It’s probably less on all internet but more on the rlhf guidelines (I imagine the human reviewers receive a guideline based on the LLM-training company’s policy, legal, and safety experts’ advice). I don’t disagree though that it could present a relatively more objective view on some topics than a particular individual (depending on the definition of bias).