RSS

Honesty

TagLast edit: Mar 3, 2021, 4:47 PM by Yoav Ravid

Honesty means telling the truth and not being deceptive.

External Links:
Against Lie Inflation by Scott Alexander

Related Pages: Meta-Honesty, Deception.

Notes on Honesty

David GrossOct 28, 2020, 12:54 AM
46 points
6 comments20 min readLW link

Deep Honesty

AletheophileMay 7, 2024, 8:31 PM
158 points
25 comments9 min readLW link

Meta-Hon­esty: Firm­ing Up Hon­esty Around Its Edge-Cases

Eliezer YudkowskyMay 29, 2018, 12:59 AM
142 points
155 comments27 min readLW link4 reviews

Speak­ing Truth to Power Is a Schel­ling Point

Zack_M_DavisDec 30, 2019, 6:12 AM
52 points
19 comments2 min readLW link

Hon­esty: Beyond In­ter­nal Truth

Eliezer YudkowskyJun 6, 2009, 2:59 AM
67 points
87 comments4 min readLW link

As­sume Bad Faith

Zack_M_DavisAug 25, 2023, 5:36 PM
152 points
63 comments7 min readLW link3 reviews

“PR” is cor­ro­sive; “rep­u­ta­tion” is not.

AnnaSalamonFeb 14, 2021, 3:32 AM
321 points
95 comments2 min readLW link3 reviews

The Forces of Bland­ness and the Disagree­able Majority

sarahconstantinApr 28, 2019, 7:44 PM
132 points
27 comments3 min readLW link2 reviews
(srconstantin.wordpress.com)

Truth­ful LMs as a warm-up for al­igned AGI

Jacob_HiltonJan 17, 2022, 4:49 PM
65 points
14 comments13 min readLW link

Firm­ing Up Not-Ly­ing Around Its Edge-Cases Is Less Broadly Use­ful Than One Might Ini­tially Think

Zack_M_DavisDec 27, 2019, 5:09 AM
127 points
43 comments8 min readLW link2 reviews

On Bounded Distrust

ZviFeb 3, 2022, 2:50 PM
135 points
19 comments56 min readLW link1 review
(thezvi.wordpress.com)

How do new mod­els from OpenAI, Deep­Mind and An­thropic perform on Truth­fulQA?

Owain_EvansFeb 26, 2022, 12:46 PM
44 points
3 comments11 min readLW link

Paper: Teach­ing GPT3 to ex­press un­cer­tainty in words

Owain_EvansMay 31, 2022, 1:27 PM
97 points
7 comments4 min readLW link

Mar­riage, the Giv­ing What We Can Pledge, and the dam­age caused by vague pub­lic commitments

Jeffrey LadishJul 11, 2022, 7:38 PM
98 points
27 comments6 min readLW link1 review

Op­ti­mized Pro­pa­ganda with Bayesian Net­works: Com­ment on “Ar­tic­u­lat­ing Lay The­o­ries Through Graph­i­cal Models”

Zack_M_DavisJun 29, 2020, 2:45 AM
105 points
10 comments4 min readLW link

Maybe Ly­ing Can’t Ex­ist?!

Zack_M_DavisAug 23, 2020, 12:36 AM
58 points
16 comments5 min readLW link

“Des­per­ate Hon­esty” by Agnes Callard

David GrossAug 1, 2023, 1:34 PM
11 points
0 comments2 min readLW link
(dailynous.com)

Ar­gue Poli­tics* With Your Best Friends

sarahconstantinDec 15, 2018, 7:00 PM
75 points
6 comments6 min readLW link
(srconstantin.wordpress.com)

“Sta­tus” can be cor­ro­sive; here’s how I han­dle it

AkashJan 24, 2023, 1:25 AM
71 points
8 comments6 min readLW link

[Question] How “hon­est” is GPT-3?

abramdemskiJul 8, 2020, 7:38 PM
72 points
18 comments5 min readLW link

Hon­est Friends Don’t Tell Com­fort­ing Lies

Serpent-StareApr 19, 2018, 4:34 PM
21 points
11 comments5 min readLW link

Notes on Sincer­ity and such

David GrossDec 1, 2020, 5:09 AM
9 points
2 comments10 min readLW link

Rad­i­cal Honesty

Eliezer YudkowskySep 10, 2007, 6:09 AM
42 points
37 comments2 min readLW link

De­grees of Rad­i­cal Honesty

MBlumeMar 31, 2009, 8:36 PM
34 points
51 comments3 min readLW link

In­tegrity and ac­countabil­ity are core parts of rationality

habrykaJul 15, 2019, 8:22 PM
169 points
68 comments6 min readLW link1 review

How to Corner Liars: A Mi­asma-Clear­ing Protocol

ymeskhoutFeb 27, 2025, 5:18 PM
59 points
23 comments7 min readLW link
(www.ymeskhout.com)

The Good Try Rule

DirectedEvolutionDec 27, 2020, 2:38 AM
56 points
4 comments4 min readLW link

Ly­ing is Cowardice, not Strategy

Oct 24, 2023, 1:24 PM
31 points
73 comments5 min readLW link
(cognition.cafe)

Com­mu­ni­ca­tion Re­quires Com­mon In­ter­ests or Differ­en­tial Sig­nal Costs

Zack_M_DavisMar 26, 2021, 6:41 AM
40 points
13 comments3 min readLW link1 review

Maybe Ly­ing Doesn’t Exist

Zack_M_DavisOct 14, 2019, 7:04 AM
70 points
59 comments8 min readLW link

[Question] How to build com­mon knowl­edge of ra­tio­nal­ity and hon­esty?

MikkWFeb 21, 2021, 6:07 AM
5 points
3 comments1 min readLW link

Neo-Mohism

Bae's TheoremJun 16, 2021, 9:57 PM
5 points
11 comments7 min readLW link

Truth­ful AI: Devel­op­ing and gov­ern­ing AI that does not lie

Oct 18, 2021, 6:37 PM
82 points
9 comments10 min readLW link

Lay­ers Of Mind

PeteGOct 4, 2022, 4:52 PM
−8 points
4 comments2 min readLW link

Glo­ma­riza­tion FAQ

ZaneNov 15, 2023, 8:20 PM
33 points
5 comments5 min readLW link

How “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion” Fits Into a Broader Align­ment Scheme

CollinDec 15, 2022, 6:22 PM
244 points
39 comments16 min readLW link1 review

Hon­esty, Open­ness, Trust­wor­thi­ness, and Secrets

NormanPerlmutterMar 6, 2023, 9:03 AM
13 points
0 comments9 min readLW link

Five Rea­sons to Lie

DzoldzayaJan 17, 2023, 4:53 PM
0 points
19 comments3 min readLW link

How to find cool things in a new place

Sam F. BrownJan 24, 2023, 11:20 AM
12 points
0 comments1 min readLW link

[RFC] Pos­si­ble ways to ex­pand on “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion”.

Jan 25, 2023, 7:03 PM
48 points
6 comments12 min readLW link

Dis­cus­sion: Was SBF a naive util­i­tar­ian, or a so­ciopath?

Nicholas / Heather KrossNov 17, 2022, 2:52 AM
0 points
4 comments1 min readLW link

Con­trol Vec­tors as Dis­po­si­tional Traits

Gianluca CalcagniJun 23, 2024, 9:34 PM
10 points
0 comments11 min readLW link

Truth is Univer­sal: Ro­bust De­tec­tion of Lies in LLMs

Lennart BuergerJul 19, 2024, 2:07 PM
24 points
3 comments2 min readLW link
(arxiv.org)

On In­ten­tion­al­ity, or: Towards a More In­clu­sive Con­cept of Lying

Cornelius DybdahlOct 18, 2024, 10:37 AM
8 points
0 comments4 min readLW link

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

Nov 8, 2023, 11:37 AM
49 points
0 comments18 min readLW link

The Jor­dan Peter­son Mask

Jacob FalkovichMar 3, 2018, 7:49 PM
61 points
154 comments12 min readLW link

Ci­vil­ity Is Never Neutral

ozymandiasNov 22, 2017, 4:54 PM
57 points
15 comments4 min readLW link

The Im­por­tance of Say­ing “Oops”

Eliezer YudkowskyAug 5, 2007, 3:17 AM
267 points
35 comments2 min readLW link

How to par­ent more predictably

jefftkJul 10, 2018, 3:18 PM
78 points
1 comment4 min readLW link

In­di­vi­d­ual De­ni­a­bil­ity, Statis­ti­cal Honesty

AlicornAug 9, 2011, 4:17 AM
62 points
8 comments1 min readLW link

White Lies

ChrisHallquistFeb 8, 2014, 1:20 AM
60 points
903 comments5 min readLW link

Hufflepuff Cynicism

abramdemskiFeb 13, 2018, 2:15 AM
25 points
17 comments6 min readLW link

Con­trast Pairs Drive the Em­piri­cal Perfor­mance of Con­trast Con­sis­tent Search (CCS)

Scott EmmonsMay 31, 2023, 5:09 PM
97 points
1 comment6 min readLW link1 review

Speak­ing up pub­li­cly is heroic

jefftkNov 2, 2019, 12:00 PM
44 points
2 comments1 min readLW link
(www.jefftk.com)

Pro­tected From Myself

Eliezer YudkowskyOct 19, 2008, 12:09 AM
48 points
30 comments6 min readLW link

Avoid­ing Selec­tion Bias

the gears to ascensionOct 4, 2017, 7:10 PM
20 points
17 comments1 min readLW link

Ground-Truth La­bel Im­bal­ance Im­pairs the Perfor­mance of Con­trast-Con­sis­tent Search (and Other Con­trast-Pair-Based Un­su­per­vised Meth­ods)

Aug 5, 2023, 5:55 PM
6 points
2 comments7 min readLW link
(drive.google.com)

Ethics Notes

Eliezer YudkowskyOct 21, 2008, 9:57 PM
20 points
46 comments11 min readLW link

You don’t need Kant

Apr 1, 2009, 6:09 PM
2 points
59 comments5 min readLW link

Lies and Secrets

steven0461Mar 8, 2009, 2:43 PM
19 points
21 comments2 min readLW link

De­clare your sig­nal­ing and hid­den agen­das

Kaj_SotalaApr 13, 2009, 12:01 PM
25 points
21 comments3 min readLW link

Toxic Truth

MichaelHowardApr 11, 2009, 11:25 AM
16 points
31 comments1 min readLW link

Dis­cov­er­ing La­tent Knowl­edge in the Hu­man Brain: Part 1 – Clar­ify­ing the con­cepts of be­lief and knowledge

Joseph EmersonOct 15, 2023, 9:02 AM
5 points
0 comments12 min readLW link

par­ent­ing rules

Dave OrrDec 21, 2020, 7:48 PM
156 points
9 comments5 min readLW link
No comments.