RSS

AlexMennen(Alex Mennen)

Karma: 4,451

What is cal­ibra­tion?

AlexMennen13 Mar 2023 6:30 UTC
27 points
1 comment4 min readLW link

Search­ing for a model’s con­cepts by their shape – a the­o­ret­i­cal framework

23 Feb 2023 20:14 UTC
51 points
0 comments19 min readLW link

Event [Berkeley]: Align­ment Col­lab­o­ra­tor Speed-Meeting

19 Dec 2022 2:24 UTC
18 points
2 comments1 min readLW link

Why bet Kelly?

AlexMennen15 Nov 2022 18:12 UTC
32 points
14 comments5 min readLW link

Aver­age prob­a­bil­ities, not log odds

AlexMennen12 Nov 2021 21:39 UTC
27 points
20 comments5 min readLW link

Map­ping Out Alignment

15 Aug 2020 1:02 UTC
43 points
0 comments5 min readLW link

AlexMen­nen’s Shortform

AlexMennen8 Dec 2019 4:51 UTC
7 points
1 comment1 min readLW link

When wish­ful think­ing works

AlexMennen1 Sep 2018 23:43 UTC
41 points
1 comment3 min readLW link

Safely and use­fully spec­tat­ing on AIs op­ti­miz­ing over toy worlds

AlexMennen31 Jul 2018 18:30 UTC
24 points
16 comments2 min readLW link

Com­pu­ta­tional effi­ciency rea­sons not to model VNM-ra­tio­nal prefer­ence re­la­tions with util­ity functions

AlexMennen25 Jul 2018 2:11 UTC
16 points
5 comments3 min readLW link

A com­ment on the IDA-AlphaGoZero metaphor; ca­pa­bil­ities ver­sus alignment

AlexMennen11 Jul 2018 1:03 UTC
40 points
1 comment1 min readLW link

Log­i­cal un­cer­tainty and math­e­mat­i­cal uncertainty

AlexMennen1 Jul 2018 0:33 UTC
0 points
0 comments1 min readLW link
(www.lesswrong.com)

Log­i­cal un­cer­tainty and Math­e­mat­i­cal uncertainty

AlexMennen26 Jun 2018 1:08 UTC
35 points
6 comments4 min readLW link

More on the Lin­ear Utility Hy­poth­e­sis and the Lev­er­age Prior

AlexMennen26 Feb 2018 23:53 UTC
16 points
4 comments9 min readLW link

Value learn­ing sub­prob­lem: learn­ing goals of sim­ple agents

AlexMennen18 Dec 2017 2:05 UTC
0 points
0 comments2 min readLW link

Against the Lin­ear Utility Hy­poth­e­sis and the Lev­er­age Penalty

AlexMennen14 Dec 2017 18:38 UTC
39 points
47 comments11 min readLW link

Be­ing leg­ible to other agents by com­mit­ting to us­ing weaker rea­son­ing systems

AlexMennen3 Dec 2017 7:49 UTC
4 points
1 comment3 min readLW link

Me­ta­math­e­mat­ics and probability

AlexMennen22 Sep 2017 4:04 UTC
1 point
0 comments1 min readLW link
(alexmennen.com)

Me­ta­math­e­mat­ics and Probability

AlexMennen22 Sep 2017 3:07 UTC
1 point
0 comments1 min readLW link
(alexmennen.com)

Den­sity Zero Exploration

AlexMennen17 Aug 2017 0:43 UTC
4 points
0 comments2 min readLW link