plex

Karma: 2,889

I have signed no contracts or agreements whose existence I cannot mention.

plex Mar 17, 2025, 10:22 PM
4 points
0
in reply to: Unreal’s comment on: A Path out of Insufficient Views
Yup, DMing for context!
hmmm, I’m wondering if you’re pointing at something different from the thing in this space which I intuitively expect is good using words that sound more extreme than I’d use, or whether you’re pointing at a different thing. I’ll take a shot at describing the thing I’d be happy with of this type and you can let me know whether this feels like the thing you’re trying to point to:
An ontology restricts the shape of thought by being of a set shape. All of them are insufficient, the Tao that can be specified is not the true Tao, but each can contain patterns that are useful if you let them dissolve and continually release the meta-structures rather than cling to them as a whole. By continually releasing as much of your structure back to flow you grow much faster and in more directions, because in returning from that dissolving you reform with much more of your collected patterns integrated and get out of some of your local minima.

plex Mar 16, 2025, 8:54 PM
5 points
3
in reply to: Davidmanheim’s comment on: I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?
you could engage with the Survival and Flourishing Fund
Yeah! The S-process is pretty neat, buying into that might be a great idea once you’re ready to donate more.

plex Mar 16, 2025, 6:57 PM
2 points
0
in reply to: dx26’s comment on: I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?
Oh, yup, thanks, fixed.

plex Mar 16, 2025, 6:34 PM
23 points
5
in reply to: Daniel Kokotajlo’s comment on: I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?
Consider reaching out to Rob Miles.
He tends to get far more emails than he can handle so a cold contact might not work, but I can bump this up his list if you’re interested.

plex Mar 16, 2025, 6:32 PM
21 points
11
on: I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?
Firstly: Nice, glad to have another competent and well-resourced person on-board. Welcome to the effort.
I suggest: Take some time to form reasonably deep models of the landscape, first technical^[1] and then the major actors and how they’re interfacing with the challenge.^[2] This will inform your strategy going forward. Most people, even people who are full time in AI safety, seem to not have super deep models (so don’t let yourself be socially-memetically tugged by people who don’t have clear models).
Being independently wealthy in this field is awesome, as you’ll be able to work on whatever your inner compass points to as the best, rather than needing to track grantmaker wants and all of the accompanying stress. With that level of income you’d also be able to be one of the top handful of grantmakers in the field if you wanted, the AISafety.com donation guide has a bunch of relevant info (though might need an update sweep, feel free to ping me with questions on this).
Things look pretty bad in many directions, but it’s not over yet and the space of possible actions is vast. Best of skill finding good ones!
1. ^
  I recommend https://agentfoundations.study/, and much of https://www.aisafety.com/stay-informed, and chewing on the ideas until they’re clear enough in your mind that you can easily get them across to almost anyone. This is good practice internally as well as good for the world. The Sequences are also excellent grounding for the type of thinking needed in this field—it’s what they were designed for. Start with the highlights, maybe go on to the rest if it feels valuable. AI Safety Fundamentals courses are also worth taking, but you’ll want a lot of additional reading and thinking on top of that. I’d also be up for a call or two if you like, I’ve been doing the self-fund (+sometimes giving grants) and try and save the world thing for some time now.
2. ^
  Technical first seems best, as it’s the grounding which underpins what would be needed in governance, and will help you orient better than going straight to governance I suspect.

plex Mar 16, 2025, 12:49 PM
4 points
0
in reply to: Mateusz Bagiński’s comment on: ete’s Shortform
eh, <5%? More that we might be able to get the AIs to do most of the heavy lifting of figuring this out, but that’s a sliding scale of how much oversight the automated research systems need to not end up in wrong places.

plex Mar 16, 2025, 1:29 AM
6 points
1
on: ete’s Shortform
My current guess as to Anthropic’s effect:
1. 0-8 months shorter timelines^[1]
2. Much better chances of a good end in worlds where superalignment doesn’t require strong technical philosophy^[2] (but I put very low odds on being in this world)
3. Somewhat better chances of a good end in worlds where superalignment does require strong technical philosophy^[3]
1. ^
  Shorter due to:
  There being a number of people who might otherwise not have been willing to work for a scaling lab, or not do so as enthusiastically/effectively (~55% weight)
  Encouraging race dynamics (~30%)
  Making it less likely that there’s a broad alliance against scaling labs (15%)
  Partly counterbalanced by encouraging better infosec practices and being more encouraging of regulation than the alternatives.
2. ^
  They’re trying a bunch of the things which if alignment is easy, might actually work, and no other org has the level of leadership buy in for investing in as hard.
3. ^
  Probably though using AI assisted alignment schemes, but building org competence in doing this kind of research manually so they can direct the systems to the right problems and sort slop from sound ideas is going to need to be a priority.

plex Mar 15, 2025, 11:09 AM
4 points
0
in reply to: Unreal’s comment on: A Path out of Insufficient Views
By “discard”, do you mean remove specifically the fixed-ness in your ontology such that the cognition as a whole can move fluidly and the aspects of those models which don’t integrate with your wider system can dissolve, as opposed to the alternate interpretation where “discard” means actively root out and try and remove the concept itself (rather than the fixed-ness of it)?
(also 👋, long time no see, glad you’re doing well)

plex Mar 1, 2025, 10:23 PM
4 points
0
in reply to: Daniel Kokotajlo’s comment on: Daniel Kokotajlo’s Shortform
I had a similar experience a couple years back when running bio anchors with numbers which seemed more reasonable/less consistently slanted towards longer timelines to me, getting:
before taking into account AI accelerating AI development, which I expected to bring it a few years earlier.

plex Feb 21, 2025, 12:49 PM
2 points
1
in reply to: plex’s comment on: Arbital has been imported to LessWrong
Also I suggest that given the number of tags in each section, load more should be load all.

plex Feb 21, 2025, 12:34 PM
13 points
0
on: Arbital has been imported to LessWrong
This is awesome! Three comments:
1. Please make an easy to find Recent Changes feed (maybe a thing on the home page which only appears if you’ve made wiki edits). If you want an editor community, that will be their home, and the thing they’re keeping up with and knowing to positively reinforce each other.
2. The concepts portal is now a slightly awkward mix of articles and tags, with potentially very high use tags being quite buried because no one’s written a good article for it (e.g Rationality Quotes has 136 pages tagged, but zero karma, so requires many clicks to reach). I’m especially thinking about the use case of wanting to know what types of articles there are to browse around. I’m not sure exactly what to do about this.. maybe having the sorting not be just about karma, but a mix of karma and number of tagged posts? Like (k+10)*(t+10) or something? Disadvantage is this is opaque and drops alphabetical much harder.
3. A bunch of the uncategorized ones could be categorized, but I’m not seeing a way to do this with normal permissions.
Adjusting 2 would make it much cleaner to categorize the many ones in 3 without that clogging up the normal lists.

plex Feb 15, 2025, 3:26 PM
8 points
0
on: A short course on AGI safety from the GDM Alignment team
Nice! I’ll watch through these then probably add a lot of them to the aisafety.video playlist.

plex Feb 15, 2025, 12:43 PM
2 points
0
on: Are current LLMs safe for psychotherapy?
I’ve heard from people I trust that:
1. They can be pretty great, if you know what you want and set the prompt up right
2. They won’t be as skilled as a human therapist, and might throw you in at the deep end or not be tracking things a human would
Using them can be very worth it as they’re always available and cheap, but they require a little intentionality. I suggest asking your human therapist for a few suggestions of kinda of work you might do with a peer or LLM assistant, and monitoring how it affects you while exploring, if you feel safe enough doing that. Maybe do it the day before a human session the first few times so you have a good safety net. Maybe ask some LWers what their system prompts are, or find some well-tested prompts elsewhere.

plex Feb 15, 2025, 12:36 PM
6 points
0
on: Celtic Knots on a hex lattice
Looks like Tantrix:

plex Feb 14, 2025, 10:49 PM
3 points
0
in reply to: harfe’s comment on: harfe’s Shortform
oh yup, sorry, I meant mid 2026, like ~6 months before the primary proper starts. But could be earlier.

plex Feb 14, 2025, 8:46 PM
9 points
0
in reply to: harfe’s comment on: harfe’s Shortform
Yeah, this seems worth a shot. If we do this, we should do our own pre-primary in like mid 2027 to select who to run in each party, so that we don’t split the vote and also so that we select the best candidate.
Someone I know was involved in a DIY pre-primary in the UK which unseated an extremely safe politician, and we’d get a bunch of extra press while doing this.

plex Feb 14, 2025, 1:01 PM
17 points
12
on: ≤10-year Timelines Remain Unlikely Despite DeepSeek and o3
Humans without scaffolding can do a very finite number of sequential reasoning steps without mistakes. That’s why thinking aids like paper, whiteboards, and other people to bounce ideas off and keep the cache fresh are so useful.

plex Feb 13, 2025, 8:27 PM
16 points
4
in reply to: Foyle’s comment on: How do we solve the alignment problem?
With a large enough decisive strategic advantage, a system can afford to run safety checks on any future versions of itself and anything else it’s interacting with sufficient to stabilize values for extremely long periods of time.
Multipolar worlds though? Yeah, they’re going to get eaten by evolution/moloch/power seeking/pythia.

plex Feb 11, 2025, 4:43 PM
22 points
0
in reply to: Eli Tyre’s comment on: Eli’s shortform feed
More cynical take based on the Musk/Altman emails: Altman was expecting Musk to be CEO. He set up a governance structure which would effectively be able to dethrone Musk, with him as the obvious successor, and was happy to staff the board with ideological people who might well take issue with something Musk did down the line to give him a shot at the throne.
Musk walked away, and it would’ve been too weird to change his mind on the governance structure. Altman thought this trap wouldn’t fire with high enough probability to disarm it at any time before it did.
I don’t know whether the dates line up to dis-confirm this, but I could see this kind of 5d chess move happening. Though maybe normal power and incentive psychological things are sufficient.

plex 9 Feb 2025 19:18 UTC
3 points
0
on: Chaos Investments v0.31
Looks fun!
I could also remove Oil Seeker’s protection from Pollution; they don’t need it for making Black Chips to be worthwhile for them but it makes that less of an amazing deal than it is.
Maybe have the pollution cost halved for Black, if removing it turns out to be too weak?