sbenthall

Karma: 365

Reward Hacking from a Causal Perspective

tom4everitt, Francis Rhys Ward, sbenthall, James Fox, mattmacdermott and RyanCarey

21 Jul 2023 18:27 UTC

29 points

5 comments7 min readLW link

Incentives from a causal perspective

tom4everitt, James Fox, RyanCarey, mattmacdermott, sbenthall and Jonathan Richens

10 Jul 2023 17:16 UTC

27 points

0 comments6 min readLW link

Causality: A Brief Introduction

tom4everitt, Lewis Hammond, Jonathan Richens, Francis Rhys Ward, RyanCarey, sbenthall and James Fox

20 Jun 2023 15:01 UTC

48 points

18 comments6 min readLW link

Introduction to Towards Causal Foundations of Safe AGI

tom4everitt, Lewis Hammond, Francis Rhys Ward, RyanCarey, James Fox, mattmacdermott and sbenthall

12 Jun 2023 17:55 UTC

67 points

6 comments4 min readLW link

sbenthall 11 Oct 2022 18:07 UTC
0 points
in reply to: EniScien’s comment on: Ukraine Situation Report 2022/03/01
This point about Ukrainian neo-Nazis is very misunderstood by the West.
During the Maidan revolution in Ukraine in 2014, neo-Nazi groups occupied government buildings and brought about a transition of government.
Why are there neo-Nazis in Ukraine? Because during WWII, the Nazis and the USSR were fighting over Ukraine. Ukraine is today quite ethnically diverse, and some of the ‘western’ Ukrainians who were resentful of USSR rule and, later, Russian influence, have reclaimed nazi ideas as part of a far-right Ukrainian nationalism. Some of these nazi groups that were originally militias have been incorporated into the Ukrainian military.
This is all quite well documented:
https://en.wikipedia.org/wiki/2014_Euromaidan_regional_state_administration_occupations
https://jacobin.com/2022/02/maidan-protests-neo-nazis-russia-nato-crimea
One of the regiments most well known to have Nazi ties was defeated at the Siege of Mariupol
https://en.wikipedia.org/wiki/Azov_Regiment
Naturally, this history is downplayed in presentations of Ukrainian nationalism targeted at the West, and emphasized in Russia depictions of Ukraine.

sbenthall 11 Oct 2022 17:54 UTC
1 point
0
on: Ukraine Post #12
Thanks for writing this. I have been fretting for some time and realized that what I needed was a rational take on the war. I appreciate the time you’ve taken you write this out and I’ll check out your other posts on this.

Don’t Fear the Reaper: Refuting Bostrom’s Superintelligence Argument

sbenthall1 Mar 2017 14:28 UTC

9 points

20 comments1 min readLW link

Autonomy, utility, and desire; against consequentialism in AI design

sbenthall3 Dec 2014 17:34 UTC

7 points

5 comments3 min readLW link

more on predicting agents

sbenthall8 Nov 2014 6:43 UTC

1 point

11 comments2 min readLW link

sbenthall 8 Nov 2014 6:14 UTC
0 points
in reply to: Dagon’s comment on: prediction and capacity to represent
This seems correct to me. Thank you.

sbenthall 8 Nov 2014 6:13 UTC
0 points
in reply to: Wes_W’s comment on: prediction and capacity to represent
You don’t know anything about how cars work?

sbenthall 8 Nov 2014 6:11 UTC
0 points
in reply to: ChristianKl’s comment on: prediction and capacity to represent

It’s possible to predict the behavior of black boxes without knowing anything about their internal structure.

Elaborate?

That says a lot more about your personal values then the general human condition.

I suppose you are right.

The models of worms might be a bit better at predicting worm behavior but they are not perfect.

They are significantly closer to being perfect than our models of humans. I think you are right in pointing out that where you draw the line is somewhat arbitrary. But the point is the variation on the continuum.

sbenthall 8 Nov 2014 6:07 UTC
0 points
in reply to: SolveIt’s comment on: prediction and capacity to represent
Do you think it is something external to the birds that make them migrate?

prediction and capacity to represent

sbenthall4 Nov 2014 6:09 UTC

−9 points

20 comments1 min readLW link

AI Tao

sbenthall21 Oct 2014 1:15 UTC

−17 points

3 comments1 min readLW link

sbenthall 21 Oct 2014 0:16 UTC
3 points
in reply to: Gunnar_Zarncke’s comment on: What is optimization power, formally?
Norbert Wiener is where it all starts. This book has a lot of essays. It’s interesting—he’s talking about learning machines before “machine learning” was a household word, but envisioning it as electrical circuits.

http://www.amazon.com/Cybernetics-Second-Edition-Control-Communication/dp/026273009X

I think that it’s important to look inside the boxes. We know a lot about the mathematical limits of boxes which could help us understand whether and how they might go foom.

Thank you for introducing me to that Concrete Mathematics book. That looks cool.

I would be really interested to see how you model this problem. I’m afraid that op-amps are not something I’m familiar with but it sounds like you are onto something.

sbenthall 21 Oct 2014 0:10 UTC
2 points
on: Four things every community should do
Do you think that rationalism is becoming a religion, or should become one?

sbenthall 21 Oct 2014 0:08 UTC
2 points
in reply to: Stuart_Armstrong’s comment on: What is optimization power, formally?
Thanks. That criticism makes sense to me. You put the point very concretely.

What do you think of the use of optimization power in arguments about takeoff speed and x-risk?

Or do you have a different research agenda altogether?

sbenthall 21 Oct 2014 0:05 UTC
2 points
in reply to: lukeprog’s comment on: What is optimization power, formally?
That makes sense. I’m surprised that I haven’t found any explicit reference to that in the literature I’ve been looking at. Is that because it is considered to be implicitly understood?

One way to talk about optimization power, maybe, would be to consider a spectrum between unbounded, LaPlacean rationality and the dumbest things around. There seems to be a move away from this though, because it’s too tied to notions of intelligence and doesn’t look enough at outcomes?

It’s this move that I find confusing.

sbenthall 20 Oct 2014 23:56 UTC
5 points
in reply to: DavidLS’s comment on: Fixing Moral Hazards In Business Science
There are people in my department who do work in this area. I can reach out and ask them.

I think Mechanical Turk gets used a lot for survey experiments because it has a built-in compensation mechanism and there are ways to ask questions in ways that filter people into precisely what you want.

I wouldn’t dismiss Facebook ads so quickly. I bet there is a way to target mobile app developers on that.

My hunch is that like survey questions, sampling methods are going to need to be tuned case-by-case and patterns extracted inductively from that. Good social scientific experiment design is very hard. Standardizing it is a noble but difficult task.