Zach Stein-Perlman comments on Zach Stein-Perlman’s Shortform

Zach Stein-Perlman 6 Sep 2024 20:11 UTC
4 points
2
1. I tentatively think this is a high-priority ask
2. Capabilities research isn’t a monolith and improving capabilities without increasing spooky black-box reasoning seems pretty fine
3. If you’re right, I think the upshot is (a) Anthropic should figure out whether to publish stuff rather than let it languish and/or (b) it would be better for lots of Anthropic safety researchers to instead do research that’s safe to share (rather than research that only has value if Anthropic wins the race)