If ChatGPT is asked questions like “should we put in safeguards in the development of self-improving AIs” then is it likely to answer “yes”. Now, if ChatGPT was given political power, it becomes a policy that world leaders need to solve. Do we need constraints on GPU computing clusters? Maybe ChatGPT answers “STOP”, because it thinks the question is too complex to answer directly. It is always more difficult to decide on what actions to do in order to implement general policies, than agreeing about the overall policy. However, if we can align our overall policy decisions, then we might have a better chance dealing with threats of smarter AIs. I don’t think this will work perfectly, but it might be aimed at some sort of improvement over the current state.
what prevents a smarter ai from outsmarting it?
If ChatGPT is asked questions like “should we put in safeguards in the development of self-improving AIs” then is it likely to answer “yes”. Now, if ChatGPT was given political power, it becomes a policy that world leaders need to solve. Do we need constraints on GPU computing clusters? Maybe ChatGPT answers “STOP”, because it thinks the question is too complex to answer directly. It is always more difficult to decide on what actions to do in order to implement general policies, than agreeing about the overall policy. However, if we can align our overall policy decisions, then we might have a better chance dealing with threats of smarter AIs. I don’t think this will work perfectly, but it might be aimed at some sort of improvement over the current state.