Seth Herd comments on Matt Goldenberg’s Short Form Feed

Seth Herd 11 Oct 2024 19:34 UTC
3 points
2
What they’re saying is I got a semi-objective answer fast.

If they’d googled for the answer all the same concerns would apply. You’d need to know the biases of whoever wrote the web content they read to get an answer.

I doubt the orga got much of their own bias into the RLHF/RLAIF process. There are real cultural biases from the humans answering RLHF and the LLM itself from the training set and how it interpreted its constitution.
- Matt Goldenberg 11 Oct 2024 19:40 UTC
  7 points
  4
  Parent
  What they’re saying is I got a semi-objective answer fast.
  Exactly. Please stop saying this. It’s not semi-objective. The trend of casually treating LLMs as an arbiter of truth leads to moral decay.
  I doubt the orga got much of their own bias into the RLHF/RLAIF process
  This is obviously untrue, orgs spend lots of effort making sure their AI doesn’t say things that would give them bad press for example.
  - Seth Herd 11 Oct 2024 20:43 UTC
    2 points
    0
    Parent
    I should’ve specified—the orgs carefully train to get them to refuse to say things. I don’t think the specifically train them to say things the orgs like or believe. The refusals are intentional, the bias is accidental IMO.
    
    And every source has bias.
    
    So, do you want people.to.quit saying they googled for an answer? I just like them to say where they got the answer so I can judge how biased it might be.