A Ray comments on A descriptive, not prescriptive, overview of current AI Alignment Research

A Ray 8 Jun 2022 21:54 UTC
LW: 12 AF: 7
AF
Thanks so much for making this!
I’m hopeful this sort of dataset will grow over time as new sources come about.
In particular, I’d nominate adding MLSN (https://www.alignmentforum.org/posts/R39tGLeETfCZJ4FoE/mlsn-4-many-new-interpretability-papers-virtual-logit) to the list of newsletters in the future.
- Ethan Perez 26 Jul 2022 21:02 UTC
  LW: 2 AF: 1
  AF Parent
  Yes super excited about datasets like this! It might be helpful to also add https://ai-alignment.com/ or https://paulfchristiano.medium.com/ if these aren’t already in the data
  - jacquesthibs 26 Jul 2022 23:07 UTC
    LW: 2 AF: 1
    AF Parent
    I believe all of those posts can be found on the Alignment Forum so, luckily, they are included in the dataset (at least from what I remember after checking a handful of the posts). I had begun scraping from those sources, but realized they were already on AF halfway through.
    - Ethan Perez 4 Aug 2022 21:10 UTC
      LW: 1 AF: 1
      AF Parent
      Cool, that’s great!
- jacquesthibs 8 Jun 2022 22:52 UTC
  1 point
  Parent
  Good idea! I added most of the papers from the previous entries of MLSN. Adding the summaries would be a useful next step. Would be great if someone could keep track of it in a Google Sheet of individual summaries like the Alignment Newsletter (https://docs.google.com/spreadsheets/d/1lJ6431R-E6aioVRd7AN4LQYTj-QhQlUYNRbGDbG5RWY/edit?usp=sharing).
  I was also considering adding distillations as a key as well. For example, adding ELK distillations to the ELK report entry.