If the AI is designed to follow the principle by the letter, it has to request approval from the designer even for the action of requesting approval, leaving the AI incapable of action.
If the AI is designed to be able to make certain exemptions, it will figure out a way to modify the designer without needing approval for this modification.
If the AI is designed to follow the principle by the letter, it has to request approval from the designer even for the action of requesting approval, leaving the AI incapable of action. If the AI is designed to be able to make certain exemptions, it will figure out a way to modify the designer without needing approval for this modification.
How about making ‘ask for approval’ the only pre-approved action?