Maybe others are using it in secret but don’t want to admit it for some reason? I can’t find any mention of Anthropic having filed a patent on the idea, but maybe other companies would feel too much like it looked like they were second-rate imitators if they said they were copying Anthropic’s idea?
Just speculating, I don’t know. Sure seems like a useful idea to copy.
Maybe others are using it in secret but don’t want to admit it for some reason? I can’t find any mention of Anthropic having filed a patent on the idea, but maybe other companies would feel too much like it looked like they were second-rate imitators if they said they were copying Anthropic’s idea?
Just speculating, I don’t know. Sure seems like a useful idea to copy.
AI companies don’t seem to be shy about copying RLHF though. Llama, Gemini, and Grok are all explicitly labeled as using RLHF.