Ajeya Cotra comments on Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

Ajeya Cotra 22 Jul 2022 15:37 UTC
3 points
0
I mean things like tricks to improve the sample efficiency of human feedback, doing more projects that are un-enhanced RLHF to learn things about how un-enhanced RLHF works, etc.