RSS

sbenthall

Karma: 365

Re­ward Hack­ing from a Causal Perspective

21 Jul 2023 18:27 UTC
29 points
5 comments7 min readLW link

In­cen­tives from a causal perspective

10 Jul 2023 17:16 UTC
27 points
0 comments6 min readLW link

Causal­ity: A Brief Introduction

20 Jun 2023 15:01 UTC
48 points
18 comments6 min readLW link

In­tro­duc­tion to Towards Causal Foun­da­tions of Safe AGI

12 Jun 2023 17:55 UTC
67 points
6 comments4 min readLW link

Don’t Fear the Reaper: Re­fut­ing Bostrom’s Su­per­in­tel­li­gence Argument

sbenthall1 Mar 2017 14:28 UTC
9 points
20 comments1 min readLW link

Au­ton­omy, util­ity, and de­sire; against con­se­quen­tial­ism in AI design

sbenthall3 Dec 2014 17:34 UTC
7 points
5 comments3 min readLW link

more on pre­dict­ing agents

sbenthall8 Nov 2014 6:43 UTC
1 point
11 comments2 min readLW link

pre­dic­tion and ca­pac­ity to represent

sbenthall4 Nov 2014 6:09 UTC
−9 points
20 comments1 min readLW link

AI Tao

sbenthall21 Oct 2014 1:15 UTC
−17 points
3 comments1 min readLW link