sbenthall

Karma: 370

Reward Hacking from a Causal Perspective

tom4everitt, Francis Rhys Ward, sbenthall, James Fox, mattmacdermott and RyanCarey

Jul 21, 2023, 6:27 PM

29 points

6 comments7 min readLW link

Incentives from a causal perspective

tom4everitt, James Fox, RyanCarey, mattmacdermott, sbenthall and Jonathan Richens

Jul 10, 2023, 5:16 PM

27 points

0 comments6 min readLW link

Causality: A Brief Introduction

tom4everitt, Lewis Hammond, Jonathan Richens, Francis Rhys Ward, RyanCarey, sbenthall and James Fox

Jun 20, 2023, 3:01 PM

49 points

18 comments6 min readLW link

Introduction to Towards Causal Foundations of Safe AGI

tom4everitt, Lewis Hammond, Francis Rhys Ward, RyanCarey, James Fox, mattmacdermott and sbenthall

Jun 12, 2023, 5:55 PM

70 points

6 comments4 min readLW link

sbenthall Oct 11, 2022, 6:07 PM
0 points
0
in reply to: EniScien’s comment on: Ukraine Situation Report 2022/03/01
This point about Ukrainian neo-Nazis is very misunderstood by the West.
During the Maidan revolution in Ukraine in 2014, neo-Nazi groups occupied government buildings and brought about a transition of government.
Why are there neo-Nazis in Ukraine? Because during WWII, the Nazis and the USSR were fighting over Ukraine. Ukraine is today quite ethnically diverse, and some of the ‘western’ Ukrainians who were resentful of USSR rule and, later, Russian influence, have reclaimed nazi ideas as part of a far-right Ukrainian nationalism. Some of these nazi groups that were originally militias have been incorporated into the Ukrainian military.
This is all quite well documented:
https://en.wikipedia.org/wiki/2014_Euromaidan_regional_state_administration_occupations
https://jacobin.com/2022/02/maidan-protests-neo-nazis-russia-nato-crimea
One of the regiments most well known to have Nazi ties was defeated at the Siege of Mariupol
https://en.wikipedia.org/wiki/Azov_Regiment
Naturally, this history is downplayed in presentations of Ukrainian nationalism targeted at the West, and emphasized in Russia depictions of Ukraine.

sbenthall Oct 11, 2022, 5:54 PM
1 point
0
on: Ukraine Post #12
Thanks for writing this. I have been fretting for some time and realized that what I needed was a rational take on the war. I appreciate the time you’ve taken you write this out and I’ll check out your other posts on this.

Don’t Fear the Reaper: Refuting Bostrom’s Superintelligence Argument

sbenthallMar 1, 2017, 2:28 PM

9 points

20 comments1 min readLW link

Autonomy, utility, and desire; against consequentialism in AI design

sbenthallDec 3, 2014, 5:34 PM

7 points

5 comments3 min readLW link

more on predicting agents

sbenthallNov 8, 2014, 6:43 AM

1 point

11 comments2 min readLW link

sbenthall Nov 8, 2014, 6:14 AM
0 points
0
in reply to: Dagon’s comment on: prediction and capacity to represent
This seems correct to me. Thank you.

sbenthall Nov 8, 2014, 6:13 AM
0 points
0
in reply to: Wes_W’s comment on: prediction and capacity to represent
You don’t know anything about how cars work?

sbenthall Nov 8, 2014, 6:11 AM
0 points
0
in reply to: ChristianKl’s comment on: prediction and capacity to represent

It’s possible to predict the behavior of black boxes without knowing anything about their internal structure.

Elaborate?

That says a lot more about your personal values then the general human condition.

I suppose you are right.

The models of worms might be a bit better at predicting worm behavior but they are not perfect.

They are significantly closer to being perfect than our models of humans. I think you are right in pointing out that where you draw the line is somewhat arbitrary. But the point is the variation on the continuum.

sbenthall Nov 8, 2014, 6:07 AM
0 points
0
in reply to: SolveIt’s comment on: prediction and capacity to represent
Do you think it is something external to the birds that make them migrate?

prediction and capacity to represent

sbenthallNov 4, 2014, 6:09 AM

−9 points

20 comments1 min readLW link

AI Tao

sbenthallOct 21, 2014, 1:15 AM

−17 points

3 comments1 min readLW link

sbenthall Oct 21, 2014, 12:16 AM
3 points
0
in reply to: Gunnar_Zarncke’s comment on: What is optimization power, formally?
Norbert Wiener is where it all starts. This book has a lot of essays. It’s interesting—he’s talking about learning machines before “machine learning” was a household word, but envisioning it as electrical circuits.

http://www.amazon.com/Cybernetics-Second-Edition-Control-Communication/dp/026273009X

I think that it’s important to look inside the boxes. We know a lot about the mathematical limits of boxes which could help us understand whether and how they might go foom.

Thank you for introducing me to that Concrete Mathematics book. That looks cool.

I would be really interested to see how you model this problem. I’m afraid that op-amps are not something I’m familiar with but it sounds like you are onto something.

sbenthall Oct 21, 2014, 12:10 AM
2 points
0
on: Four things every community should do
Do you think that rationalism is becoming a religion, or should become one?

sbenthall Oct 21, 2014, 12:08 AM
2 points
0
in reply to: Stuart_Armstrong’s comment on: What is optimization power, formally?
Thanks. That criticism makes sense to me. You put the point very concretely.

What do you think of the use of optimization power in arguments about takeoff speed and x-risk?

Or do you have a different research agenda altogether?

sbenthall Oct 21, 2014, 12:05 AM
2 points
0
in reply to: lukeprog’s comment on: What is optimization power, formally?
That makes sense. I’m surprised that I haven’t found any explicit reference to that in the literature I’ve been looking at. Is that because it is considered to be implicitly understood?

One way to talk about optimization power, maybe, would be to consider a spectrum between unbounded, LaPlacean rationality and the dumbest things around. There seems to be a move away from this though, because it’s too tied to notions of intelligence and doesn’t look enough at outcomes?

It’s this move that I find confusing.

sbenthall Oct 20, 2014, 11:56 PM
5 points
0
in reply to: DavidLS’s comment on: Fixing Moral Hazards In Business Science
There are people in my department who do work in this area. I can reach out and ask them.

I think Mechanical Turk gets used a lot for survey experiments because it has a built-in compensation mechanism and there are ways to ask questions in ways that filter people into precisely what you want.

I wouldn’t dismiss Facebook ads so quickly. I bet there is a way to target mobile app developers on that.

My hunch is that like survey questions, sampling methods are going to need to be tuned case-by-case and patterns extracted inductively from that. Good social scientific experiment design is very hard. Standardizing it is a noble but difficult task.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer