DanielFilan comments on Chris Olah’s views on AGI safety

DanielFilan 10 Dec 2020 2:56 UTC
LW: 9 AF: 5
AF
I think that this post is a good description of a way of thinking about the usefulness of transparency and interpretability for AI alignment that I think is underrated by the LW-y AI safety community.