Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
EA comments on
Using GPT-Eliezer against ChatGPT Jailbreaking
EA
10 Dec 2022 13:39 UTC
1
point
0
This might work a bit better:
e.g., the following confused the previous version (which didn’t allow the benign answer):
but
Back to top
This might work a bit better:
e.g., the following confused the previous version (which didn’t allow the benign answer):
but