LW server reports: not allowed.
This probably means the post has been deleted or moved back to the author's drafts.
I’d rather say that RLHF+’ed chatbots are upon-reflection-not-so-shockingly sycophantic, since they have been trained to satisfy their conversational partner.
I’d rather say that RLHF+’ed chatbots are upon-reflection-not-so-shockingly sycophantic, since they have been trained to satisfy their conversational partner.