Agreed, this was an expected result. It’s nice to have a functioning example to point to for LLMs in an RLHF context, though.
Agreed, this was an expected result. It’s nice to have a functioning example to point to for LLMs in an RLHF context, though.