though there is some hope we can come up with clever things in the future that will allow us to use reinforcement learning to also increase corrigibility
Any particular research directions you’re optimistic about?
Any particular research directions you’re optimistic about?