Honesty

TagLast edit: Mar 3, 2021, 4:47 PM by Yoav Ravid

Honesty means telling the truth and not being deceptive.

External Links:
Against Lie Inflation by Scott Alexander

Related Pages: Meta-Honesty, Deception.

Notes on Honesty

David GrossOct 28, 2020, 12:54 AM

46 points

6 comments20 min readLW link

Deep Honesty

AletheophileMay 7, 2024, 8:31 PM

158 points

25 comments9 min readLW link

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

Eliezer YudkowskyMay 29, 2018, 12:59 AM

142 points

155 comments27 min readLW link 4 reviews

Speaking Truth to Power Is a Schelling Point

Zack_M_DavisDec 30, 2019, 6:12 AM

52 points

19 comments2 min readLW link

Honesty: Beyond Internal Truth

Eliezer YudkowskyJun 6, 2009, 2:59 AM

67 points

87 comments4 min readLW link

Assume Bad Faith

Zack_M_DavisAug 25, 2023, 5:36 PM

150 points

63 comments7 min readLW link 3 reviews

“PR” is corrosive; “reputation” is not.

AnnaSalamonFeb 14, 2021, 3:32 AM

322 points

95 comments2 min readLW link 3 reviews

The Forces of Blandness and the Disagreeable Majority

sarahconstantinApr 28, 2019, 7:44 PM

132 points

27 comments3 min readLW link 2 reviews

(srconstantin.wordpress.com)

Truthful LMs as a warm-up for aligned AGI

Jacob_HiltonJan 17, 2022, 4:49 PM

65 points

14 comments13 min readLW link

Firming Up Not-Lying Around Its Edge-Cases Is Less Broadly Useful Than One Might Initially Think

Zack_M_DavisDec 27, 2019, 5:09 AM

128 points

43 comments8 min readLW link 2 reviews

On Bounded Distrust

ZviFeb 3, 2022, 2:50 PM

137 points

19 comments56 min readLW link 1 review

(thezvi.wordpress.com)

How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?

Owain_EvansFeb 26, 2022, 12:46 PM

44 points

3 comments11 min readLW link

Paper: Teaching GPT3 to express uncertainty in words

Owain_EvansMay 31, 2022, 1:27 PM

97 points

7 comments4 min readLW link

Marriage, the Giving What We Can Pledge, and the damage caused by vague public commitments

Jeffrey LadishJul 11, 2022, 7:38 PM

98 points

27 comments6 min readLW link 1 review

Optimized Propaganda with Bayesian Networks: Comment on “Articulating Lay Theories Through Graphical Models”

Zack_M_DavisJun 29, 2020, 2:45 AM

105 points

10 comments4 min readLW link

Maybe Lying Can’t Exist?!

Zack_M_DavisAug 23, 2020, 12:36 AM

58 points

16 comments5 min readLW link

“Desperate Honesty” by Agnes Callard

David GrossAug 1, 2023, 1:34 PM

11 points

0 comments2 min readLW link

(dailynous.com)

Argue Politics* With Your Best Friends

sarahconstantinDec 15, 2018, 7:00 PM

75 points

6 comments6 min readLW link

(srconstantin.wordpress.com)

“Status” can be corrosive; here’s how I handle it

Orpheus16Jan 24, 2023, 1:25 AM

71 points

8 comments6 min readLW link

[Question] How “honest” is GPT-3?

abramdemskiJul 8, 2020, 7:38 PM

72 points

18 comments5 min readLW link

Honest Friends Don’t Tell Comforting Lies

Serpent-StareApr 19, 2018, 4:34 PM

21 points

11 comments5 min readLW link

Notes on Sincerity and such

David GrossDec 1, 2020, 5:09 AM

9 points

2 comments10 min readLW link

Radical Honesty

Eliezer YudkowskySep 10, 2007, 6:09 AM

43 points

37 comments2 min readLW link

Degrees of Radical Honesty

MBlumeMar 31, 2009, 8:36 PM

34 points

51 comments3 min readLW link

Integrity and accountability are core parts of rationality

habrykaJul 15, 2019, 8:22 PM

169 points

68 comments6 min readLW link 1 review

How to Corner Liars: A Miasma-Clearing Protocol

ymeskhoutFeb 27, 2025, 5:18 PM

60 points

23 comments7 min readLW link

(www.ymeskhout.com)

The Good Try Rule

DirectedEvolutionDec 27, 2020, 2:38 AM

56 points

4 comments4 min readLW link

Lying is Cowardice, not Strategy

Connor Leahy and Gabriel Alfour

Oct 24, 2023, 1:24 PM

29 points

73 comments5 min readLW link

(cognition.cafe)

Communication Requires Common Interests or Differential Signal Costs

Zack_M_DavisMar 26, 2021, 6:41 AM

40 points

13 comments3 min readLW link 1 review

Maybe Lying Doesn’t Exist

Zack_M_DavisOct 14, 2019, 7:04 AM

70 points

59 comments8 min readLW link

[Question] How to build common knowledge of rationality and honesty?

MikkWFeb 21, 2021, 6:07 AM

5 points

3 comments1 min readLW link

Neo-Mohism

Bae's TheoremJun 16, 2021, 9:57 PM

5 points

11 comments7 min readLW link

Truthful AI: Developing and governing AI that does not lie

Owain_Evans, owencb and Lukas Finnveden

Oct 18, 2021, 6:37 PM

82 points

9 comments10 min readLW link

Layers Of Mind

PeteGOct 4, 2022, 4:52 PM

−8 points

4 comments2 min readLW link

Glomarization FAQ

ZaneNov 15, 2023, 8:20 PM

33 points

5 comments5 min readLW link

How “Discovering Latent Knowledge in Language Models Without Supervision” Fits Into a Broader Alignment Scheme

CollinDec 15, 2022, 6:22 PM

244 points

39 comments16 min readLW link 1 review

Honesty, Openness, Trustworthiness, and Secrets

NormanPerlmutterMar 6, 2023, 9:03 AM

13 points

0 comments9 min readLW link

Five Reasons to Lie

DzoldzayaJan 17, 2023, 4:53 PM

0 points

19 comments3 min readLW link

How to find cool things in a new place

Sam F. BrownJan 24, 2023, 11:20 AM

12 points

0 comments1 min readLW link

[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.

gekaklam, Walter Laurito , Kaarel and Kay Kozaronek

Jan 25, 2023, 7:03 PM

48 points

6 comments12 min readLW link

Discussion: Was SBF a naive utilitarian, or a sociopath?

Nicholas / Heather KrossNov 17, 2022, 2:52 AM

0 points

4 comments1 min readLW link

Control Vectors as Dispositional Traits

Gianluca CalcagniJun 23, 2024, 9:34 PM

10 points

0 comments11 min readLW link

Truth is Universal: Robust Detection of Lies in LLMs

Lennart BuergerJul 19, 2024, 2:07 PM

24 points

3 comments2 min readLW link

(arxiv.org)

On Intentionality, or: Towards a More Inclusive Concept of Lying

Cornelius DybdahlOct 18, 2024, 10:37 AM

8 points

0 comments4 min readLW link

Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models

Felix Hofstätter, Francis Rhys Ward, HarrietW, LAThomson, Ollie J, Patrik Bartak and Sam F. Brown

Nov 8, 2023, 11:37 AM

49 points

0 comments18 min readLW link

The Jordan Peterson Mask

Jacob FalkovichMar 3, 2018, 7:49 PM

61 points

154 comments12 min readLW link

Civility Is Never Neutral

ozymandiasNov 22, 2017, 4:54 PM

57 points

15 comments4 min readLW link

The Importance of Saying “Oops”

Eliezer YudkowskyAug 5, 2007, 3:17 AM

268 points

35 comments2 min readLW link

How to parent more predictably

jefftkJul 10, 2018, 3:18 PM

78 points

1 comment4 min readLW link

Individual Deniability, Statistical Honesty

AlicornAug 9, 2011, 4:17 AM

62 points

8 comments1 min readLW link

White Lies

ChrisHallquistFeb 8, 2014, 1:20 AM

60 points

903 comments5 min readLW link

Hufflepuff Cynicism

abramdemskiFeb 13, 2018, 2:15 AM

25 points

17 comments6 min readLW link

Contrast Pairs Drive the Empirical Performance of Contrast Consistent Search (CCS)

Scott EmmonsMay 31, 2023, 5:09 PM

97 points

1 comment6 min readLW link 1 review

Speaking up publicly is heroic

jefftkNov 2, 2019, 12:00 PM

44 points

2 comments1 min readLW link

(www.jefftk.com)

Protected From Myself

Eliezer YudkowskyOct 19, 2008, 12:09 AM

48 points

30 comments6 min readLW link

Avoiding Selection Bias

the gears to ascensionOct 4, 2017, 7:10 PM

20 points

17 comments1 min readLW link

Ground-Truth Label Imbalance Impairs the Performance of Contrast-Consistent Search (and Other Contrast-Pair-Based Unsupervised Methods)

Tom Angsten and Ami Hays

Aug 5, 2023, 5:55 PM

6 points

2 comments7 min readLW link

(drive.google.com)

Ethics Notes

Eliezer YudkowskyOct 21, 2008, 9:57 PM

20 points

46 comments11 min readLW link

You don’t need Kant

Apr 1, 2009, 6:09 PM

2 points

59 comments5 min readLW link

Lies and Secrets

steven0461Mar 8, 2009, 2:43 PM

19 points

21 comments2 min readLW link

Declare your signaling and hidden agendas

Kaj_SotalaApr 13, 2009, 12:01 PM

25 points

21 comments3 min readLW link

Toxic Truth

MichaelHowardApr 11, 2009, 11:25 AM

16 points

31 comments1 min readLW link

Discovering Latent Knowledge in the Human Brain: Part 1 – Clarifying the concepts of belief and knowledge

Joseph EmersonOct 15, 2023, 9:02 AM

5 points

0 comments12 min readLW link

parenting rules

Dave OrrDec 21, 2020, 7:48 PM

156 points

9 comments5 min readLW link

No comments.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer