Yeah. Just off the top of my head, OpenAI’s safety group has put some work into language models (e.g.), and there’s a new group called Preamble working on helping recommender systems meet certain desiderata.
Also see the most recent post on LW from Beth Barnes.
Yeah. Just off the top of my head, OpenAI’s safety group has put some work into language models (e.g.), and there’s a new group called Preamble working on helping recommender systems meet certain desiderata.
Also see the most recent post on LW from Beth Barnes.