This is a really interesting interview with lots of great ideas. Thanks for taking notes on this!
The only point I don’t really agree with is the idea that Redwood Research, Anthropic, and ARC are correlated. Although they are all in the same geographic area, they seem to be working on fairly different projects to me:
Redwood Research: controlling the output of language models.
Anthropic: deep transformer interpretability work.
This is a really interesting interview with lots of great ideas. Thanks for taking notes on this!
The only point I don’t really agree with is the idea that Redwood Research, Anthropic, and ARC are correlated. Although they are all in the same geographic area, they seem to be working on fairly different projects to me:
Redwood Research: controlling the output of language models.
Anthropic: deep transformer interpretability work.
ARC: theoretical alignment research (e.g. ELK).