Søren Elverlin

Karma: 558

Søren Elverlin May 17, 2024, 8:51 AM
2 points
0
in reply to: Allthepinecones’s comment on: AstralCodexTen / LessWrong Meetup
You are very welcome, and I think you’ll fit right in. It’s quite a coincidence that you’re interested in documentary productions, as a documentary producer is visiting us for the first hour.

There’s a symbolic “AI Box” to contain AI discussion. I’d like to talk about RUF and the transportation infrastructure of Dath Ilan with you, but I usually end up in the box no matter what I do. :)

Søren Elverlin May 15, 2024, 7:21 AM
1 point
0
in reply to: tailcalled’s comment on: AstralCodexTen / LessWrong Meetup
Great!

AstralCodexTen / LessWrong Meetup

Søren ElverlinMay 14, 2024, 5:39 PM

4 points

5 comments1 min readLW link

AstralCodexTen / LessWrong Meetup

Søren ElverlinMar 4, 2024, 7:01 AM

4 points

0 comments1 min readLW link

Søren Elverlin Feb 2, 2024, 7:24 AM
1 point
0
in reply to: Roko’s comment on: Without Fundamental Advances, Rebellion and Coup d’État are the Inevitable Outcomes of Dictators & Monarchs Trying to Control Large, Capable Countries
Just to be sure I’m following you: When you are talking about the AI oppressor, are you envisioning some kind of recursive oversight scheme?

I assume here that your spoof is arguing that since we observe stable dictatorships, we should increase our probability that we will also be stable in our positions as dictators of a largely AI-run economy. (I recognize that it can be interpreted in other ways).

We expect we will have the two advantages over the AIs: We will be able to read their parameters directly, and we will be able to read any communication we wish. This is clearly insufficient, so we will need to have “AI Opressors” to help us interpret the mountains of data.

Two obvious objections:
1. How do we ensure the alignment of the AI Opressors?
2. Proper oversight of an agent that is more capable than yourself seems to become dramatically harder as the capability gap increases.

Søren Elverlin Jan 31, 2024, 7:07 PM
7 points
0
on: Without Fundamental Advances, Rebellion and Coup d’État are the Inevitable Outcomes of Dictators & Monarchs Trying to Control Large, Capable Countries
This post clearly spoofs Without fundamental advances, misalignment and catastrophe are the default outcomes of training powerful AI, though it changes “default” to “inevitable”.

I think that coup d’États and rebellions are nearly common enough that they could be called the default, though they are certainly not inevitable.

I enjoyed this post. Upvoted.

AstralCodexTen / LessWrong Meetup

Søren ElverlinJan 8, 2024, 7:13 AM

4 points

0 comments1 min readLW link

Søren Elverlin Dec 28, 2023, 5:17 PM
9 points
0
on: Critical review of Christiano’s disagreements with Yudkowsky
On this subject, here is my 2 hours long presentation (in 3 parts), going over just about every paragraph in Paul Christiano’s “Where I agree and disagree with Eliezer”:

https://youtu.be/V8R0s8tesM0?si=qrSJP3V_WnoBptkL

https://youtu.be/a2qTNuD1Sn8?si=YHyCr8AC0HkEnN4J

https://youtu.be/8XWbPDvKgM0?si=SvLfL4bhHDO6zDBu

Søren Elverlin Dec 3, 2023, 8:44 AM
4 points
0
in reply to: Screwtape’s comment on: 2023 Unofficial LessWrong Census/Survey
I have now also taken the 2023 organizer census.

Søren Elverlin Dec 2, 2023, 7:22 PM
3 points
0
in reply to: tailcalled’s comment on: 2023 Unofficial LessWrong Census/Survey

The government knows well how to balance costs and benefits.

Consider this story (in Danish): The Danish Ministry of Finance are aware that the decisions they are making are short-sighted, but are making them anyway for political reasons.

If one believed this decision was representative of the government in general, would one agree with your statement or disagree with it?

Søren Elverlin Dec 2, 2023, 7:12 PM
28 points
0
on: 2023 Unofficial LessWrong Census/Survey
I took the survey, and enjoyed it. There was a suggestion to also fill out the Rationalist Organizer Census, 2023. I can’t remember if I have already filled it out, or I’m mixing it together with the 2022 Census. Is it new?

Søren Elverlin Nov 28, 2023, 11:16 AM
3 points
0
on: Could World War I have been prevented given the benefit of hindsight?
Tell the truth about the devastation caused, if possible also to the public.

Germany ought to be more reluctant to attack with the knowledge that they lost hard in another timeline.

Tell them how much better EU-style cooperation is.

Suggest a NATO-style alliance.

If a Great War is started, promise to help the defenders by telling them everything.

Søren Elverlin Oct 13, 2023, 11:10 AM
3 points
0
on: Solstice 2023 Roundup
Copenhagen, Denmark

6th of January, 15:00 local time

https://www.lesswrong.com/events/Wfu4KLg84ZrANFuWn/astralcodexten-lesswrong-meetup-7

AstralCodexTen / LessWrong Meetup

Søren ElverlinOct 8, 2023, 5:07 PM

5 points

0 comments1 min readLW link

AstralCodexTen / LessWrong Meetup

Søren ElverlinAug 7, 2023, 10:54 AM

4 points

0 comments1 min readLW link

Søren Elverlin May 30, 2023, 7:21 AM
2 points
0
on: A very non-technical explanation of the basics of infra-Bayesianism
We discussed this post in the AISafety.com Reading Group, and have a few questions about it and infra-bayesianism:
1. The image on top of the sequence on Infra-Bayesianism shows a tree, which we interpret as a game-tree, with Murphy and an agent alternating in taking actions. Can we say anything about such a tree? E.g. Complexity, Pruning, etc?
2. There was some discussion about if an infra-bayesian agent could be Dutch-booked. Is this possible?
3. Your introduction makes no attempt to explain “convexity”, which seems like a central part of Infra-Bayesianism. If it is central, what would be a good one-paragraph summary?
4. Will any sufficiently smart agent be infra-bayesian? To be precise, can you replace “Bayesian” with “Infra-Bayesian” in this article: https://arbital.com/p/optimized_agent_appears_coherent/ ?

AstralCodexTen / LessWrong Meetup

Søren ElverlinMay 28, 2023, 6:21 PM

5 points

0 comments1 min readLW link

Søren Elverlin May 26, 2023, 7:02 AM
3 points
0
in reply to: Chris_Leong’s comment on: Retrospective: Lessons from the Failed Alignment Startup AISafety.com
Yes, we were excited when we learned about ARC Evals. Some kind of evaluation was one of our possible paths to impact, though real-world data is much more messy than the carefully constructed evaluations I’ve seen ARC use. This has both advantages and disadvantages.

Søren Elverlin May 25, 2023, 10:35 AM
3 points
0
in reply to: Adam Zerner’s comment on: Retrospective: Lessons from the Failed Alignment Startup AISafety.com
I think a “Wizard of Oz”-style MVP may have been feasible, though a big part of our value proposition was speed. In retrospect, I could maybe have told the customer that the speed would be slower the first couple of months, and they likely would have accepted that. If I had done so, we plausibly could have failed faster, which is highly desirable.

Søren Elverlin May 25, 2023, 10:27 AM
1 point
0
in reply to: Chris_Leong’s comment on: Retrospective: Lessons from the Failed Alignment Startup AISafety.com
Back 18 months ago, my (now falsified) theory was that some of the limitations we were seeing in GPT-3 were symptoms of a general inability of LLMs to reason strategically. This would have significant implications for alignment, in particular for our estimates of when they would become dangerous.

We noticed some business processes required a big-picture out-of-the-box kind of thinking that was kinda strategic if you squint, and observed that GPT-3 seemed to consistently fail to perform them in the same way as humans. Our hope was that by implementing these processes (as well as simpler and adjacent processes) we would be able to more precisely delineate what the strategic limitations of GPT-3 were.

Søren Elverlin

As­tralCodexTen /​ LessWrong Meetup

As­tralCodexTen /​ LessWrong Meetup

As­tralCodexTen /​ LessWrong Meetup

As­tralCodexTen /​ LessWrong Meetup

As­tralCodexTen /​ LessWrong Meetup

As­tralCodexTen /​ LessWrong Meetup

AstralCodexTen / LessWrong Meetup

AstralCodexTen / LessWrong Meetup

AstralCodexTen / LessWrong Meetup

AstralCodexTen / LessWrong Meetup

AstralCodexTen / LessWrong Meetup

AstralCodexTen / LessWrong Meetup