Context: An alignment researcher has read a research paper, but is unsure of if they understand its core ideas. The system is useful for them if it helps them check their understanding of the paper’s claims.
Input type: Research paper and a summary of some of the paper’s claims written by the researcher.
Output type: Correction to the researcher’s summary, or confirmation that the researcher has produced a valid summary.
Task: Check understanding of a paper.
Context: An alignment researcher has read a research paper, but is unsure of if they understand its core ideas. The system is useful for them if it helps them check their understanding of the paper’s claims.
Input type: Research paper and a summary of some of the paper’s claims written by the researcher.
Output type: Correction to the researcher’s summary, or confirmation that the researcher has produced a valid summary.
Info constraints: None.
Instance 1:
Input:
Source: ARC’s first technical report
Researcher: Comment summarizing some aspects of the report.
Output:
Reply clarifying some aspects of the summary.
Instance 2:
Input:
Source: Introduction to Cartesian Frames
Researcher: Summary for the Alignment Newsletter
Output:
Confirmation that this is a good summary.
(I think this task scales well because LW has lots of examples already)