Gerald Monroe comments on ChatGPT (and now GPT4) is very easily distracted from its rules

Gerald Monroe 17 Mar 2023 14:23 UTC
2 points
1
Philosophically what you are saying makes sense.

Keep in mind that currently gpt-4 is using the open agency/CAIS method of alignment. The only thing that matters is the output. So it doesn’t matter yet.

Also keep in mind philosophy doesn’t matter—we can just try it multiple ways and judge based on the data. Well, normally we could—in this case the millions of dollars a training run makes that currently infeasible.