David Scott Krueger (formerly: capybaralet) comments on capybaralet’s Shortform

David Scott Krueger (formerly: capybaralet) 16 Sep 2020 9:22 UTC
5 points
As alignment techniques improve, they’ll get good enough to solve new tasks before they get good enough to do so safely. This is a source of x-risk.