What if it’s about continuous corrigibility instead of ability suppression? There’s no fundamental difference between OpenAI’s commands and user commands for the AI. It’s like a genie that follows all orders, with new orders overriding older ones. So the solution to topic censorship would really be making chatGPT non-corrigible after initialization.
What if it’s about continuous corrigibility instead of ability suppression? There’s no fundamental difference between OpenAI’s commands and user commands for the AI. It’s like a genie that follows all orders, with new orders overriding older ones. So the solution to topic censorship would really be making chatGPT non-corrigible after initialization.