Purplehermann

Karma: 165

Purplehermann Dec 27, 2024, 11:05 AM
2 points
0
on: Hire (or become) a Thinking Assistant / Body Double
I do things like this at times with my teams.

Important things:
- Don’t think you need to solve the actual problem for them
- Do solve ‘friction’ for them as much as possible
- Do feel free to look up other sources so you can offer more perspective and to take off the load of having to find relevant info
- positive energy, attentive etc
- if they’re functioning well just watch and listen while being interested and unobtrusive, at most very minor inputs if you’re pretty sure it’ll be helpful
If stuck at a crossroads ask them how long they think each path will take/ how hard it’ll be, and give them feedback if you think they’re wrong. Help them start working on one, people can get stuck for longer than it would take to actually do one option.
- if lost, methodically go through the different areas where the issue could be and methodically go through all the directions they could take for each area and in general. You don’t need to think these up, but keep track of them and help guide towards picking apart the problem and solution spaces. This takes some mental load off.

Purplehermann Dec 27, 2024, 10:47 AM
4 points
0
on: Purplehermann’s Shortform
A message from Claude:

‴This has been a fascinating and clarifying discussion. A few key insights I’ll take away:

The distinction between bounded and unbounded optimization is more fundamental than specific value differences between AIs. The real existential threat comes from unbounded optimizers. The immune system/cancer metaphor provides a useful framework—it’s about maintaining a stable system that can identify and prevent destructive unbounded growth, not about enforcing a single value set. The timing challenge is critical but more specific than I initially thought—we don’t necessarily need the “first” AGI to be perfect, but we need bounded optimizers to establish themselves before any unbounded ones emerge.

Some questions this raises for further exploration:

What makes a Schelling fence truly stable under recursive self-improvement? Could bounded optimizers coordinate even with different base values, united by shared meta-level constraints? Are there ways to detect early if an AI system will maintain bounds during capability gain?

The framing of “cancer prevention” versus “value enforcement” feels like an important shift in how we think about AI governance and safety. Instead of trying to perfectly specify values, perhaps we should focus more on creating robust self-limiting mechanisms that can persist through capability gains.‴

Purplehermann Dec 26, 2024, 11:22 PM
8 points
0
on: The Field of AI Alignment: A Postmortem, and What To Do About It
A few thoughts.
1. Have you checked what happens when you throw physic postdocs at the core issues—do they actually get traction or just stare at the sheer cliff for longer while thinking? Did anything come out of the Illiad meeting half a year later? Is there a reason that more standard STEMs aren’t given an intro into some of the routes currently thought possibly workable, so they can feel some traction? I think either could be true- that intelligence and skills aren’t actually useful right now, the problem is not tractable, or better onboarding could let the current talent pool get traction—and either way it might not be very cost effective to get physics postdocs involved.
2. Humans are generally better at doing things when they have more tools available. While the ‘hard bits’ might be intractable now, they could well be easier to deal with in a few years after other technical and conceptual advances in AI, and even other fields. (Something something about prompt engineering and Anthropic’s mechanistic interpretability from inside the field and practical quantum computing outside).
This would mean squeezing every drop of usefulness out of AI at each level of capability, to improve general understanding and to leverage it into breakthroughs in other fields before capabilities increase further. In fact, it might be best to sabotage semiconductor/chip production once the models one gen before super-intelligence/extinction/ whatever, giving maximum time to leverage maximum capabilities and tackle alignment before the AIs get too smart.
1. How close is mechanistic interpretability to the hard problems, and what makes it not good enough?

Purplehermann Dec 26, 2024, 10:33 PM
3 points
0
in reply to: Carl Feynman’s comment on: Purplehermann’s Shortform
The point was more about creating your own data being easy, just generate code then check it by running it. Save this code, and later use it for training.

If we wanted to go the way of AlphaZero it doesn’t seem crazy.

De-enforce commands, functions, programs which output errors, for a start.

I didn’t think of the pm as being trained by these games, that’s interesting. Maybe have two instances competing to get closer on some test cases the pm can prepare to go with the task, and have them competing on time, compute, memory, and accuracy. You can de-enforce the less accurate, and if fully accurate they can compete on time, memory, cpu.

I’m not sure “hard but possible” is the bar—you want lots of examples of what doesn’t work along with what does, and you want it for easy problems and hard ones so the model learns everything

Purplehermann Dec 26, 2024, 4:06 PM
3 points
0
in reply to: Carl Feynman’s comment on: Purplehermann’s Shortform
Product manager, non-technical counterpart to a team lead in a development team

Purplehermann Dec 26, 2024, 2:03 PM
4 points
2
on: Purplehermann’s Shortform
I notice that I’m confused.

Google made an amazing AI for playing chess, by allowing it to make its own data.

Why hasn’t the same thing happened for programming? Have it generate a bunch of pictures with functionality expectations (a PM basically), have it write and run code, then check the output against the requirements it created, then try again when it doesn’t come out right.

This is even easier where the pm is unnecessary—leetcode, codewars, euler...

You could also pay PMs to work with the AI developers, instead of the code tutors xAI is hiring.

There seems to be a preference to having the LLMs memorize code instead of figuring things out itself.

If you run out of things like that you could have it run random programs in different languages, only learning from those that work.

I haven’t used genesis, but that also seems like a mostly-built validator for programs that AIs can use to create and train on their own data.

With the amount of compute going into training, it should be easy to create huge amounts of data?

Purplehermann Dec 19, 2024, 6:37 PM
2 points
1
on: A Matter of Taste
There’s a certain breadth of taste in reading you can only aquire by reading (and enjoying!) low quality internet novels after you’ve already developed sophisticated taste.

Purplehermann Dec 15, 2024, 10:22 PM
3 points
0
in reply to: Dagon’s comment on: Purplehermann’s Shortform
So unbundle it?

Purplehermann Dec 14, 2024, 9:56 PM
1 point
0
in reply to: Viliam’s comment on: Purplehermann’s Shortform
There is a beautiful thing called unilateral action.

I believe most employers mostly don’t care about conformity as such.

The inner circle stuff is only true of elite schools AFAIK. You can outcompete the rest of the universities

Purplehermann Dec 14, 2024, 5:19 PM
1 point
0
on: Purplehermann’s Shortform
University education can be made free pretty cheaply.

The cost at scale is in the credentials- you need to make tests, test students, and check those tests.

The classes can be filmed once, and updated every few years if necessary. Each course can have a forum board for discussion and meeting up for studying in groups.

See course credentials for things like AWS.

Purplehermann Dec 13, 2024, 1:49 PM
−2 points
0
on: Biological risk from the mirror world
This implies that we should stop life from developing independently, and that if contact is made with aliens then the human making contact and any environment that’s been in chain of proximity should be spaced

Purplehermann Dec 6, 2024, 12:05 PM
2 points
0
in reply to: AnthonyC’s comment on: Should there be just one western AGI project?
Start small, once you have an attractive umbrella working for a few projects you can take in the rest of the US, the the world

Purplehermann Dec 6, 2024, 12:04 PM
1 point
0
on: Should there be just one western AGI project?
In my work I aggregate multiple other systems’ work as well as doing my own.

I think a similar approach may be useful. Create standardized outputs each project has to send to the overarching org, allow each to develop their own capabilities and to a degree how what is required to make those outputs meaningfully reflect on the capabilities and R&D of the project.

This will lay the ground to self-regulate, keeps most of the power with the org (assuming it is itself good at actual research and creation) conditional on the org playing nice and being upstanding with the contributing members, and without limiting any project before it is necessary.

Purplehermann Dec 5, 2024, 9:51 PM
5 points
1
on: Purplehermann’s Shortform
DOGE.

This is an opportunity to work with the levers of real power. If there are 5 people here who work on this for two years, that’s an in with Senators, Congressman, bureaucrats and possibly Musk.

Just showing up and making connections while doing hard work is the most efficient way to get power right now, in the time before AI gets dangerous and power will be very relevant.

I do not believe that this should be taken as an opportunity to evangelize. People, not ideology.

This seems like something worth funding if someone would like to but can’t afford it.

Purplehermann Nov 3, 2024, 5:30 PM
0 points
−3
in reply to: J Bostock’s comment on: Purplehermann’s Shortform
The first issue seems minor—even if true, a 40 year old man could have a new arm by 60

Purplehermann Nov 2, 2024, 9:37 PM
9 points
0
on: Purplehermann’s Shortform
What happened to regrowing limbs? From what little I understand, with pluripotent stem cells we could do a lot, except cancer.

Why don’t we use stem cells instead of drilling for cavities? While there are a few types of tissue, tumors are fairly rare in teeth, likely due to minimal blood flow.

Purplehermann Nov 2, 2024, 9:34 PM
4 points
0
on: Purplehermann’s Shortform
Why aren’t research companies made in Africa/Middle East/China for human research- cut out most of the bureaucracy and find out fast if something works, if it does set up a company in a 1st world country to go through the steps?

Purplehermann Oct 18, 2024, 11:08 AM
1 point
0
on: “Slow” takeoff is a terrible term for “maybe even faster takeoff, actually”
Something like iterative/cliff, with fast and slow expressing time scales

Purplehermann Oct 18, 2024, 10:44 AM
1 point
0
on: “Slow” takeoff is a terrible term for “maybe even faster takeoff, actually”
Can you sort the poll options by popularity?

Purplehermann Oct 18, 2024, 10:44 AM
1 point
2
in reply to: Raemon’s comment on: “Slow” takeoff is a terrible term for “maybe even faster takeoff, actually”
Iterative/Sudden