Also, it looks like we are getting AIs that are easy to make corrigible, and thus align them iteratively to DWIM goals, but that the AI models can’t be released to the public without restrictions, because it will still be able to be highly misused.
Also, it looks like we are getting AIs that are easy to make corrigible, and thus align them iteratively to DWIM goals, but that the AI models can’t be released to the public without restrictions, because it will still be able to be highly misused.