Gurkenglas comments on Language Model Alignment Research Internships

Gurkenglas Dec 14, 2021, 5:14 PM
2 points
For 1., you could use the existing LM to judge whether each training datum ought to be included, or you could curate less than GPT-3 by also including reddit links with <3 karma.
For 2., you mean it should take natural-language peer-review-like feedback?
For 3., I suspect that such tasks scale given a different prompt.