Paul Christiano published his and Buck Shlegeris’ implementation at https://github.com/paulfchristiano/amplification. It’s the code behind the article Supervising strong learners by amplifying weak experts.
With William Saunders’ permission, I published a version modified by him and later me: https://github.com/rmoehn/amplification This one has changes and more documentation that allow you to run it almost out of the box.
Usable implementation of IDA available
Paul Christiano published his and Buck Shlegeris’ implementation at https://github.com/paulfchristiano/amplification. It’s the code behind the article Supervising strong learners by amplifying weak experts.
With William Saunders’ permission, I published a version modified by him and later me: https://github.com/rmoehn/amplification This one has changes and more documentation that allow you to run it almost out of the box.