Truthful AI

TagLast edit: 7 Apr 2022 16:40 UTC by Ruby

How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots

Owain_Evans28 Mar 2024 2:34 UTC

26 points

0 comments9 min readLW link

Truthfulness, standards and credibility

Joe_Collman7 Apr 2022 10:31 UTC

12 points

2 comments32 min readLW link

A tension between two prosaic alignment subgoals

Alex Lawsen 19 Mar 2023 14:07 UTC

31 points

8 comments1 min readLW link

Benchmark Study #2: TruthfulQA (Task, MCQ)

Bruce W. Lee6 Jan 2024 2:39 UTC

11 points

2 comments4 min readLW link

(arxiv.org)

Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models

Felix Hofstätter, Francis Rhys Ward, HarrietW, LAThomson, Ollie J, Patrik Bartak and Sam F. Brown

8 Nov 2023 11:37 UTC

49 points

0 comments18 min readLW link

No comments.