AlexMennen(Alex Mennen)

Karma: 4,451

What is calibration?

AlexMennen13 Mar 2023 6:30 UTC

27 points

1 comment4 min readLW link

Searching for a model’s concepts by their shape – a theoretical framework

Kaarel, gekaklam, Walter Laurito , Kay Kozaronek, AlexMennen and June Ku

23 Feb 2023 20:14 UTC

51 points

0 comments19 min readLW link

Event [Berkeley]: Alignment Collaborator Speed-Meeting

AlexMennen and Carson Jones

19 Dec 2022 2:24 UTC

18 points

2 comments1 min readLW link

Why bet Kelly?

AlexMennen15 Nov 2022 18:12 UTC

32 points

14 comments5 min readLW link

Average probabilities, not log odds

AlexMennen12 Nov 2021 21:39 UTC

27 points

20 comments5 min readLW link

Mapping Out Alignment

Logan Riggs, adamShimi, Gurkenglas, AlexMennen and Gyrodiot

15 Aug 2020 1:02 UTC

43 points

0 comments5 min readLW link

AlexMennen’s Shortform

AlexMennen8 Dec 2019 4:51 UTC

7 points

1 comment1 min readLW link

When wishful thinking works

AlexMennen1 Sep 2018 23:43 UTC

41 points

1 comment3 min readLW link

Safely and usefully spectating on AIs optimizing over toy worlds

AlexMennen31 Jul 2018 18:30 UTC

24 points

16 comments2 min readLW link

Computational efficiency reasons not to model VNM-rational preference relations with utility functions

AlexMennen25 Jul 2018 2:11 UTC

16 points

5 comments3 min readLW link

A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennen11 Jul 2018 1:03 UTC

40 points

1 comment1 min readLW link

Logical uncertainty and mathematical uncertainty

AlexMennen1 Jul 2018 0:33 UTC

0 points

0 comments1 min readLW link

(www.lesswrong.com)

Logical uncertainty and Mathematical uncertainty

AlexMennen26 Jun 2018 1:08 UTC

35 points

6 comments4 min readLW link

More on the Linear Utility Hypothesis and the Leverage Prior

AlexMennen26 Feb 2018 23:53 UTC

16 points

4 comments9 min readLW link

Value learning subproblem: learning goals of simple agents

AlexMennen18 Dec 2017 2:05 UTC

0 points

0 comments2 min readLW link

Against the Linear Utility Hypothesis and the Leverage Penalty

AlexMennen14 Dec 2017 18:38 UTC

39 points

47 comments11 min readLW link

Being legible to other agents by committing to using weaker reasoning systems

AlexMennen3 Dec 2017 7:49 UTC

4 points

1 comment3 min readLW link

Metamathematics and probability

AlexMennen22 Sep 2017 4:04 UTC

1 point

0 comments1 min readLW link

(alexmennen.com)

Metamathematics and Probability

AlexMennen22 Sep 2017 3:07 UTC

1 point

0 comments1 min readLW link

(alexmennen.com)

Density Zero Exploration

AlexMennen17 Aug 2017 0:43 UTC

4 points

0 comments2 min readLW link