Adam Jermyn comments on Prize for Alignment Research Tasks

Adam Jermyn 16 May 2022 14:05 UTC
2 points
Task: Check understanding of a paper.
- Context: An alignment researcher has read a research paper, but is unsure of if they understand its core ideas. The system is useful for them if it helps them check their understanding of the paper’s claims.
- Input type: Research paper and a summary of some of the paper’s claims written by the researcher.
- Output type: Correction to the researcher’s summary, or confirmation that the researcher has produced a valid summary.
- Info constraints: None.
Instance 1:
- Input:
  - Source: ARC’s first technical report
  - Researcher: Comment summarizing some aspects of the report.
- Output:
  - Reply clarifying some aspects of the summary.
Instance 2:
- Input:
  - Source: Introduction to Cartesian Frames
  - Researcher: Summary for the Alignment Newsletter
- Output:
  - Confirmation that this is a good summary.
(I think this task scales well because LW has lots of examples already)