RSS

Counterfactuals

Tag

Au­dit­ing LMs with coun­ter­fac­tual search: a tool for con­trol and ELK

Jacob PfauFeb 20, 2024, 12:02 AM
28 points
6 comments10 min readLW link

Causal­ity and de­ter­minism in so­cial sci­ence—An in­ves­ti­ga­tion us­ing Pearl’s causal ladder

tailcalledJan 3, 2022, 5:51 PM
12 points
10 comments9 min readLW link

Some thoughts on “The Na­ture of Coun­ter­fac­tu­als”

tailcalledJan 16, 2022, 6:12 PM
20 points
11 comments11 min readLW link

Prob­a­bil­ity The­ory Fun­da­men­tals 102: Ter­ri­tory that Prob­a­bil­ity is in the Map of

Ape in the coatMar 26, 2025, 6:40 AM
10 points
5 comments9 min readLW link

Cir­cu­lar Coun­ter­fac­tu­als “Only that which Hap­pens is Pos­si­ble”

SebastianG Mar 23, 2022, 2:40 PM
4 points
15 comments9 min readLW link

Re­sults: Cir­cu­lar Depen­dency of Coun­ter­fac­tu­als Prize

Chris_LeongApr 5, 2022, 6:29 AM
19 points
0 comments1 min readLW link

Four fac­tors that mod­er­ate the in­ten­sity of emotions

RubyNov 24, 2018, 8:40 PM
63 points
11 comments8 min readLW link

Get­ting Un­stuck on Counterfactuals

Chris_LeongJul 20, 2022, 5:31 AM
7 points
1 comment2 min readLW link

Ap­ply­ing the Coun­ter­fac­tual Pri­soner’s Dilemma to Log­i­cal Uncertainty

Chris_LeongSep 16, 2020, 10:34 AM
9 points
5 comments2 min readLW link

Coun­ter­fac­tu­als from en­sem­bles of peers

David JohnstonJan 4, 2022, 7:01 AM
3 points
4 comments7 min readLW link

The Many Faces of In­fra-Beliefs

DiffractorApr 6, 2021, 10:43 AM
30 points
6 comments63 min readLW link

My Cur­rent Take on Counterfactuals

abramdemskiApr 9, 2021, 5:51 PM
54 points
57 comments25 min readLW link

The Na­ture of Counterfactuals

Chris_LeongJun 5, 2021, 9:18 AM
16 points
18 comments4 min readLW link

Agency and the un­re­li­able au­tonomous car

Alex FlintJul 7, 2021, 2:58 PM
29 points
24 comments10 min readLW link

Coun­ter­fac­tual Contracts

harsimonySep 16, 2021, 3:20 PM
10 points
4 comments9 min readLW link
(harsimony.wordpress.com)

De­ci­sions: On­tolog­i­cally Shift­ing to Determinism

Chris_LeongDec 21, 2022, 12:41 PM
8 points
11 comments6 min readLW link

Coun­ter­fac­tu­als are Con­fus­ing be­cause of an On­tolog­i­cal Shift

Chris_LeongAug 5, 2022, 7:03 PM
17 points
35 comments2 min readLW link

Coun­ter­fac­tu­als for Perfect Predictors

Chris_LeongAug 6, 2018, 12:24 PM
12 points
17 comments6 min readLW link

Coun­ter­fac­tual Induction

DiffractorDec 17, 2019, 5:03 AM
22 points
7 comments6 min readLW link

Coun­ter­fac­tual Re­pro­gram­ming De­ci­sion Theory

lukeprogSep 10, 2012, 1:35 AM
18 points
8 comments1 min readLW link

Coun­ter­fac­tual Calcu­la­tion and Ob­ser­va­tional Knowledge

Vladimir_NesovJan 31, 2011, 4:28 PM
20 points
188 comments1 min readLW link

Coun­ter­fac­tu­als as a mat­ter of So­cial Convention

Chris_LeongNov 30, 2019, 10:35 AM
10 points
4 comments2 min readLW link

Coun­ter­fac­tual trade

owencbMar 9, 2015, 1:23 PM
22 points
19 comments3 min readLW link

Coun­ter­fac­tual Mug­ging and Log­i­cal Uncertainty

Vladimir_NesovSep 5, 2009, 10:31 PM
16 points
21 comments3 min readLW link

Coun­ter­fac­tu­als: Smok­ing Le­sion vs. New­comb’s

Chris_LeongDec 8, 2019, 9:02 PM
9 points
24 comments3 min readLW link

Coun­ter­fac­tu­ally un­in­fluence­able agents

Stuart_ArmstrongJun 2, 2017, 4:17 PM
11 points
0 comments2 min readLW link

Coun­ter­fac­tu­als and re­flec­tive oracles

NisanSep 5, 2018, 8:54 AM
9 points
0 comments6 min readLW link

Coun­ter­fac­tual In­duc­tion (Al­gorithm Sketch, Fix­point proof)

DiffractorDec 17, 2019, 5:04 AM
5 points
2 comments7 min readLW link

[Question] Coun­ter­fac­tual Mug­ging: Why should you pay?

Chris_LeongDec 17, 2019, 10:16 PM
7 points
59 comments3 min readLW link

Coun­ter­fac­tual mug­ging: alien ab­duc­tion edition

EmileSep 28, 2010, 9:25 PM
4 points
18 comments1 min readLW link

Coun­ter­fac­tual In­duc­tion (Lemma 4)

DiffractorDec 17, 2019, 5:05 AM
4 points
0 comments7 min readLW link

Coun­ter­fac­tual do-what-I-mean

Stuart_ArmstrongOct 27, 2016, 1:54 PM
5 points
3 comments1 min readLW link

Coun­ter­fac­tual Mug­ging v. Sub­jec­tive Probability

MBlumeJul 20, 2009, 4:31 PM
4 points
32 comments1 min readLW link

Coun­ter­fac­tu­als on POMDP

Stuart_ArmstrongJun 2, 2017, 4:30 PM
2 points
0 comments2 min readLW link

Coun­ter­fac­tual self-defense

MrMindNov 23, 2012, 10:15 AM
2 points
9 comments1 min readLW link

Coun­ter­fac­tual do-what-I-mean

Stuart_ArmstrongOct 27, 2016, 1:53 PM
0 points
3 comments1 min readLW link

Log­i­cal Coun­ter­fac­tu­als and Propo­si­tion graphs, Part 1

Donald HobsonAug 22, 2019, 10:06 PM
20 points
0 comments3 min readLW link

Log­i­cal Coun­ter­fac­tu­als are low-res

ShmiOct 15, 2018, 3:36 AM
23 points
14 comments1 min readLW link
(donerkebabphilosophy.wordpress.com)

The Coun­ter­fac­tual Pri­soner’s Dilemma

Chris_LeongDec 21, 2019, 1:44 AM
21 points
17 comments3 min readLW link

Log­i­cal Coun­ter­fac­tu­als & the Co­op­er­a­tion Game

Chris_LeongAug 14, 2018, 2:00 PM
16 points
26 comments2 min readLW link

Log­i­cal Coun­ter­fac­tu­als and Propo­si­tion graphs, Part 2

Donald HobsonAug 31, 2019, 8:58 PM
13 points
0 comments3 min readLW link

Can Coun­ter­fac­tu­als Be True?

Eliezer YudkowskyJul 24, 2008, 4:40 AM
33 points
47 comments4 min readLW link

A coun­ter­fac­tual and hy­po­thet­i­cal note on AI safety design

Stuart_ArmstrongMar 11, 2015, 4:20 PM
13 points
1 comment1 min readLW link

Log­i­cal Coun­ter­fac­tu­als and Propo­si­tion graphs, Part 3

Donald HobsonSep 5, 2019, 3:03 PM
6 points
0 comments4 min readLW link

Ex­tremely Coun­ter­fac­tual Mug­ging or: the gist of Trans­par­ent Newcomb

BongoFeb 9, 2011, 3:20 PM
10 points
79 comments1 min readLW link

Prov­abil­ity Coun­ter­fac­tu­als vs Three Ax­ioms of Galles and Pearl

IAFF-User-52Aug 30, 2015, 2:48 AM
6 points
0 comments1 min readLW link
(epsilonofdoom.blogspot.com)

Log­i­cal coun­ter­fac­tu­als for ran­dom algorithms

Vanessa KosoyJan 6, 2016, 1:29 PM
5 points
0 comments10 min readLW link

[LINK] Coun­ter­fac­tual Strategies

StrilancJun 17, 2014, 7:29 PM
5 points
14 comments1 min readLW link

Log­i­cal Coun­ter­fac­tu­als Con­sis­tent Un­der Self-Modification

abramdemskiDec 15, 2015, 6:38 AM
3 points
2 comments8 min readLW link

Log­i­cal coun­ter­fac­tu­als and differ­en­tial privacy

NisanFeb 4, 2018, 12:17 AM
1 point
1 comment5 min readLW link

What makes coun­ter­fac­tu­als com­pa­rable?

Chris_LeongApr 24, 2020, 10:47 PM
11 points
6 comments3 min readLW link

The odd coun­ter­fac­tu­als of play­ing chicken

Benya_FallensteinFeb 2, 2015, 7:15 AM
6 points
0 comments8 min readLW link

Haz­ing as Coun­ter­fac­tual Mug­ging?

SilasBartaOct 11, 2010, 2:17 PM
5 points
8 comments1 min readLW link

Third-per­son counterfactuals

Benya_FallensteinFeb 3, 2015, 1:13 AM
4 points
4 comments6 min readLW link

The many coun­ter­fac­tu­als of coun­ter­fac­tual mugging

Scott GarrabrantApr 12, 2016, 8:04 PM
2 points
3 comments2 min readLW link

Sta­bi­liz­ing log­i­cal coun­ter­fac­tu­als by pseudorandomization

Vanessa KosoyMay 25, 2016, 12:05 PM
1 point
2 comments8 min readLW link

Un-ma­nipu­la­ble counterfactuals

Stuart_ArmstrongFeb 12, 2015, 7:51 PM
1 point
5 comments1 min readLW link

Orthog­o­nal­ity: ac­tion counterfactuals

Stuart_ArmstrongFeb 17, 2015, 9:04 PM
0 points
0 comments1 min readLW link

New­comblike prob­lem: Coun­ter­fac­tual Informant

ClippyApr 12, 2012, 8:25 PM
−3 points
24 comments1 min readLW link

[Question] Would solv­ing log­i­cal coun­ter­fac­tu­als solve an­throp­ics?

Chris_LeongApr 5, 2019, 11:08 AM
20 points
52 comments1 min readLW link

Op­ti­mal and Causal Coun­ter­fac­tual Worlds

Scott GarrabrantMay 12, 2015, 3:16 AM
14 points
4 comments3 min readLW link

Sleep­ing Beauty gets coun­ter­fac­tu­ally mugged

Stuart_ArmstrongMar 26, 2009, 11:44 AM
6 points
34 comments2 min readLW link

Causal graphs and counterfactuals

Stuart_ArmstrongAug 30, 2016, 4:12 PM
7 points
2 comments1 min readLW link

Tran­si­tive ne­go­ti­a­tions with coun­ter­fac­tual agents

Scott GarrabrantOct 20, 2016, 11:27 PM
4 points
0 comments1 min readLW link

Agents de­tect­ing agents: coun­ter­fac­tual ver­sus influence

Stuart_ArmstrongSep 18, 2015, 4:17 PM
5 points
4 comments7 min readLW link

Hu­mans get differ­ent counterfactuals

Stuart_ArmstrongMar 23, 2015, 2:54 PM
4 points
2 comments1 min readLW link

Causal graphs and counterfactuals

Stuart_ArmstrongAug 30, 2016, 4:06 PM
0 points
2 comments1 min readLW link

The Curse Of The Counterfactual

pjebyNov 1, 2019, 6:34 PM
140 points
35 comments19 min readLW link1 review

Two Alter­na­tives to Log­i­cal Counterfactuals

jessicataApr 1, 2020, 9:48 AM
39 points
61 comments5 min readLW link
(unstableontology.com)

Ad­dress­ing three prob­lems with coun­ter­fac­tual cor­rigi­bil­ity: bad bets, defend­ing against back­stops, and over­con­fi­dence.

RyanCareyOct 21, 2018, 12:03 PM
23 points
1 comment6 min readLW link

Stan­dard ML Or­a­cles vs Coun­ter­fac­tual ones

Stuart_ArmstrongOct 10, 2018, 8:01 PM
18 points
5 comments6 min readLW link

An en­vi­ron­ment for study­ing counterfactuals

NisanJul 11, 2018, 12:14 AM
15 points
6 comments3 min readLW link

On the Role of Coun­ter­fac­tu­als in Learning

Max KanwalJul 11, 2018, 2:45 AM
11 points
2 comments3 min readLW link

Does TDT pay in Coun­ter­fac­tual Mug­ging?

BongoNov 29, 2010, 9:31 PM
4 points
5 comments1 min readLW link

You have just been Coun­ter­fac­tu­ally Mugged!

CronoDASAug 19, 2009, 10:24 PM
7 points
25 comments1 min readLW link

[Question] De­ci­sions with Non-Log­i­cal Coun­ter­fac­tu­als: re­quest for input

reavowedOct 24, 2019, 5:23 PM
3 points
11 comments3 min readLW link

[Question] What are some con­crete prob­lems about log­i­cal coun­ter­fac­tu­als?

Chris_LeongDec 16, 2018, 10:20 AM
25 points
4 comments1 min readLW link

I Was Not Al­most Wrong But I Was Al­most Right: Close-Call Coun­ter­fac­tu­als and Bias

Kaj_SotalaMar 8, 2012, 5:39 AM
86 points
40 comments9 min readLW link

Diver­gence on Ev­i­dence Due to Differ­ing Pri­ors—A Poli­ti­cal Case Study

DavidmanheimSep 16, 2019, 11:01 AM
27 points
3 comments3 min readLW link

A use­ful level distinction

Charlie SteinerFeb 24, 2018, 6:39 AM
8 points
4 comments2 min readLW link

JFK was not as­sas­si­nated: prior prob­a­bil­ity zero events

Stuart_ArmstrongApr 27, 2016, 11:47 AM
38 points
38 comments3 min readLW link

Mo­ti­vat­ing a Se­man­tics of Log­i­cal Counterfactuals

Sam_A_BarnettSep 22, 2017, 1:10 AM
22 points
3 comments2 min readLW link

Open Prob­lems Re­gard­ing Coun­ter­fac­tu­als: An In­tro­duc­tion For Beginners

DiffractorJul 18, 2017, 2:21 AM
21 points
6 comments1 min readLW link
(www.overleaf.com)

UDT might not pay a Coun­ter­fac­tual Mugger

winwonceNov 21, 2020, 11:27 PM
5 points
18 comments2 min readLW link

Coun­ter­fac­tual Plan­ning in AGI Systems

Koen.HoltmanFeb 3, 2021, 1:54 PM
10 points
0 comments5 min readLW link

Graph­i­cal World Models, Coun­ter­fac­tu­als, and Ma­chine Learn­ing Agents

Koen.HoltmanFeb 17, 2021, 11:07 AM
6 points
2 comments10 min readLW link

Creat­ing AGI Safety Interlocks

Koen.HoltmanFeb 5, 2021, 12:01 PM
7 points
4 comments8 min readLW link

Safely con­trol­ling the AGI agent re­ward function

Koen.HoltmanFeb 17, 2021, 2:47 PM
8 points
0 comments5 min readLW link

What is a Coun­ter­fac­tual: An Ele­men­tary In­tro­duc­tion to the Causal Hierarchy

DarmaniJan 2, 2022, 3:46 AM
11 points
2 comments5 min readLW link

Ini­tial Thoughts on Dis­solv­ing “Could­ness”

DragonGodSep 22, 2022, 9:23 PM
6 points
1 comment3 min readLW link

[Sketch] Val­idity Cri­te­rion for Log­i­cal Counterfactuals

DragonGodOct 11, 2022, 1:31 PM
6 points
0 comments6 min readLW link

Against the nor­ma­tive re­al­ist’s wager

Joe CarlsmithOct 13, 2022, 4:35 PM
16 points
9 comments23 min readLW link

An On­tol­ogy for Strate­gic Epistemology

StrivingForLegibilityDec 28, 2023, 10:11 PM
9 points
0 comments5 min readLW link

Why are coun­ter­fac­tu­als elu­sive?

Martín SotoMar 3, 2023, 8:13 PM
14 points
6 comments2 min readLW link

Distributed Strate­gic Epistemology

StrivingForLegibilityDec 28, 2023, 10:12 PM
11 points
0 comments3 min readLW link

Log­i­cal Line-Of-Sight Makes Games Se­quen­tial or Loopy

StrivingForLegibilityJan 19, 2024, 4:05 AM
40 points
0 comments7 min readLW link

Coun­ter­fac­tual Mechanism Networks

StrivingForLegibilityJan 30, 2024, 8:30 PM
4 points
0 comments5 min readLW link

To Boldly Code

StrivingForLegibilityJan 26, 2024, 6:25 PM
25 points
4 comments3 min readLW link

In­cor­po­rat­ing Mechanism De­sign Into De­ci­sion Theory

StrivingForLegibilityJan 26, 2024, 6:25 PM
17 points
4 comments4 min readLW link

(Some) Hu­mans do choose 5 dol­lars in the 5-and-10 problem

Akira PyinyaApr 7, 2025, 10:56 PM
−5 points
1 comment1 min readLW link

Time­less Control

Eliezer YudkowskyJun 7, 2008, 5:16 AM
47 points
69 comments9 min readLW link

Time­less De­ci­sion The­ory and Meta-Cir­cu­lar De­ci­sion Theory

Eliezer YudkowskyAug 20, 2009, 10:07 PM
42 points
37 comments10 min readLW link

Coun­ter­fac­tu­als, thick and thin

NisanJul 31, 2018, 3:43 PM
28 points
11 comments2 min readLW link

De­con­fus­ing Log­i­cal Counterfactuals

Chris_LeongJan 30, 2019, 3:13 PM
27 points
16 comments11 min readLW link

Con­di­tion­ing, Coun­ter­fac­tu­als, Ex­plo­ra­tion, and Gears

DiffractorJul 10, 2018, 10:11 PM
28 points
1 comment5 min readLW link

Coun­ter­fac­tual Mug­ging Poker Game

Scott GarrabrantJun 13, 2018, 11:34 PM
122 points
3 comments1 min readLW link

Coun­ter­fac­tual Mugging

Vladimir_NesovMar 19, 2009, 6:08 AM
82 points
296 comments2 min readLW link

Coun­ter­fac­tual Or­a­cles = on­line su­per­vised learn­ing with ran­dom se­lec­tion of train­ing episodes

Wei DaiSep 10, 2019, 8:29 AM
52 points
26 comments3 min readLW link

Coun­ter­fac­tual out­come state tran­si­tion parameters

Anders_HJul 27, 2018, 9:13 PM
37 points
1 comment6 min readLW link

Coun­ter­fac­tual re­siliency test for non-causal models

Stuart_ArmstrongAug 30, 2012, 5:30 PM
34 points
78 comments7 min readLW link

Coun­ter­fac­tu­als ver­sus the laws of physics

Stuart_ArmstrongFeb 18, 2020, 1:21 PM
16 points
0 comments1 min readLW link

Coun­ter­fac­tu­als are an An­swer, Not a Question

Chris_LeongSep 3, 2019, 3:36 PM
14 points
6 comments4 min readLW link