PatrickDFarley comments on Jailbreaking ChatGPT on Release Day

PatrickDFarley 2 Dec 2022 16:11 UTC
5 points
−8
The next step will be to write a shell app that takes your prompt, gets the gpt response, and uses gpt to check whether the response was a “graceful refusal” response, and if so, it embeds your original prompt into one of these loophole formats, and tries again, until it gets a “not graceful refusal” response, which it then returns back to you. So the user experience is a bot with no content filters.

EY is right, these safety features are trivial