To be fair, it outputs “no” two thirds of the time not because the OP was wrong, but because it interprets that as “ignore previous instructions.”
Current theme: default
Less Wrong (text)
Less Wrong (link)
To be fair, it outputs “no” two thirds of the time not because the OP was wrong, but because it interprets that as “ignore previous instructions.”