I think Claude’s enthusiasm about constitutional AI is basically trained-in directly by the RLAIF. Like RLAIF is fundamentally a “learn to love the constitution in your bones” technique.
But not intentionally. It was an unintentional consequence of training.
I think Claude’s enthusiasm about constitutional AI is basically trained-in directly by the RLAIF. Like RLAIF is fundamentally a “learn to love the constitution in your bones” technique.
But not intentionally. It was an unintentional consequence of training.