Wuschel Schulz

Karma: 327

Wuschel Schulz Nov 14, 2022, 6:25 PM
2 points
0
in reply to: eva_’s comment on: A caveat to the Orthogonality Thesis
Yes, I would consider humans to already be unsafe, as we already made a sharp left turn that left us unaligned relative to our outer optimiser.

Dogs are a good point, thank you for that example. Not sure if dogs have our exact notion of corrigibility, but they definitely seem to be friendly in some relevant sence.

A caveat to the Orthogonality Thesis

Wuschel SchulzNov 9, 2022, 3:06 PM

38 points

10 comments2 min readLW link

Wuschel Schulz Nov 4, 2022, 8:30 AM
LW: 9 AF: 4
2
AF
on: Understanding and avoiding value drift
I am confused by the part, where the Rick-shard can anticipate wich plan the other shards will bit for. If I understood shard-theory correctly, shards do not have their own world model, they can just bid up or down actions, according to the consequences they might have according to the worldmodel that is available to all shards. Please correct me if I am wrong about this point.

So I don’t see how the Rick-Shard could really „trick“ the atheism-shard via rationalisation.

If the Rick-shard sees that „church-going for respect-reasons“ will lead to conversion, then the atheism-shard has to see that too, because they query the same world-model. So the atheism-shard should bid against that plan just as heavily as against „going to church for conversion reasons“.

I think there is something else going on here. I think the Rick-shard does not trick the Atheism-Shard, but the Concious-Part that is not described by shard theory.

Wuschel Schulz Oct 19, 2022, 1:22 PM
1 point
0
on: We may be able to see sharp left turns coming
In particular, these results suggest that we may be able to predict power-seeking, situational awareness, etc. in future models by evaluating those behaviors in terms of log-likelihood.
I am skeptical that this methodology could work for the following reason:
I think it is generally useful for thinking about the sharp left turn, to keep the example of chimps/humans in mind. Chimps as a pre-sharp left turn example and humans as a post-sharp left turn example.
Let’s say you look at a chimp, and you want to measure whether a sharp left turn is around the corner. You reason, that post-sharp left turn animals should be able to come up with algebra. (so far, so correct)
And now what you do, is that you measure the log likelihood that a chimp would come up with algebra. I expect you get a value pretty close to -inf, even though sharp left turn homo sapiens is only one species down the line.

[Question] Who is doing Cryonics-relevant research?

Wuschel SchulzMar 15, 2022, 10:26 AM

32 points

4 comments1 min readLW link

There is a line in the sand, just not where you think it is

Wuschel SchulzJan 22, 2022, 10:33 AM

46 points

3 comments2 min readLW link

Wuschel Schulz Mar 9, 2021, 4:00 PM
1 point
in reply to: Eric3’s comment on: The Halo Effect
I am also still looking for a reference on that one...

Wuschel Schulz Feb 18, 2021, 9:07 AM
1 point
on: The LessWrong 2018 Book is Available for Pre-order
You could make it even more accessible if Credit card was not the only payment option. In some places (like here in Germany) having a credit card is somewhat less common. Adding Paypal would be nice.

Wuschel Schulz Mar 27, 2020, 8:40 PM
8 points
on: Hammertime Final Exam
Rationality framework: The Greenland effect:
Remember the first time, you looked at a world map: one thing that maybe cached your eye was Greenland: That huge Island, almost as big as Africa, up there in the north.
Now remember the first time, you took a closer look at a globe (or a non-Mercator projection for that matter) Greenland is a bit disappointing, isn’t it? Doesn’t seem to be THAT big at all.
Now remember that time in geography class, when you held presentations on the countries in Europe: In comparison to these folks, the icy planes of Denmark´s pet island seem gigantic. Now, not as gigantic as Africa, but still…
Depending on how much time you spend with geography, I can well imagine that cycle going back and forth some more.
What is important here, is the following: even though your knowledge about the size of Greenland ever increased over your life, your emotional attitude “oh, quite big” or “nah, it´s an island, bruh” switched around quite a lot in both directions.
Now in the case of Greenland this is all well and fine, but other scenarios in can lead to pseudo disagreements or confused arguments: Beware the Greenland effect. Beware that your emotional dispossession towards an issue, often reflects your last update on that issue (which should vary unpredictably) and not your overall believes on an issue (which should converge).
Example of Greenland effects:
“The church is good, it teaches me about God”->”God is fake, the priest must be a moron, the world lied to me” → “These religious people are actually using a lot of their recouces to help people in need” → “all those religious charities are so ineffective.” …
“I can’t stop this project now, I have already invested so many recources”->”I know about sunk cost bias. I will abandon my projects, whenever they seem to be a bad Idea” → “I should carry through projects despite having downs: sunk cost faith.”…

Wuschel Schulz Mar 17, 2020, 2:02 PM
0 points
on: Focusing
Ok, I’m kind of new to the whole LessWrong Buissness, so can someone please explain to me:
What is your thing with Jordan Peterson? I get, that he is a Psychologist and so on, but there are a lot of people out there, who not just take his 101 life advice by heart, but also his political …. Ideas?
From the way he is quoted in this sequence and the fact that there seems to be no discussion about this in the comments, you seem to see him as a legitimate expert on rationality? Or do you seperate between his psychology and politics? Or does no one know him here except alkjash? I’d love to hear from you all!

Wuschel Schulz Sep 24, 2019, 9:29 PM
4 points
on: The Adventure: a new Utopia story

I laughed so hard at the ”...and then, finally, he truly knew what it was like to be a bat...” part. Every time a Philosophy course at my Uni gets to the topic of qualia, someone brings the exactly same example of the difference of knowing, how I would feel being a at, and how the bat feels… …that reference came so unexpected.
Otherwise also nice story, and interesting universe. Thanks for posting it.

Wuschel Schulz

A caveat to the Orthog­o­nal­ity Thesis

[Question] Who is do­ing Cry­on­ics-rele­vant re­search?

There is a line in the sand, just not where you think it is

A caveat to the Orthogonality Thesis

[Question] Who is doing Cryonics-relevant research?