RHollerith

Karma: 2,863

Richard Hollerith. 15 miles north of San Francisco. hruvulum@gmail.com

My probability that AI research will end all human life is .92. It went up drastically when Eliezer started going public with his pessimistic assessment in April 2022. Till then my confidence in MIRI (and knowing that MIRI has enough funding to employ many researchers) was keeping my probability down to about .4. (I am glad I found out about Eliezer’s assessment.)

Currently I am willing to meet with almost anyone on the subject of AI extinction risk.

Last updated 26 Sep 2023.

RHollerith Jul 11, 2025, 1:27 AM
2 points
0
in reply to: jefftk’s comment on: Masking on the Subway
Almost all the reusable respirators have the valve.

RHollerith Jul 7, 2025, 4:54 PM
2 points
0
in reply to: jefftk’s comment on: Masking on the Subway
Sure, but without a valve, you are breathing back in exhaled CO2.

Even some of the single-use masks have exhale valves.

RHollerith Jun 3, 2025, 4:46 AM
2 points
1
in reply to: LVSN’s comment on: Diabetes is Caused by Oxidative Stress
Grass is mostly (water and) carbs, just not carbs a person can digest and burn with any efficiency.

RHollerith May 21, 2025, 6:55 PM
7 points
1
in reply to: Vladimir_Nesov’s comment on: rhollerith_dot_com’s Shortform
Good point. Change my final sentence to, “A warning shot is made by the entity capable of imposing damaging consequences on you—to alert you and to give you a way to avoid the most damaging of the consequences at its disposal.”

RHollerith May 21, 2025, 3:14 PM
29 points
7
on: rhollerith_dot_com’s Shortform
Many believe that one hope for our future is that the AI labs will makes some mistake that will kill many people, but not all of us, resulting in the survivors finally realizing how dangerous AI is. I wish people would refer to that as a “near miss”, not a “warning shot”. A warning shot is when the danger (originally a warship) actually cares about you but cares about its mission more, with the result that it complicates its plans and policies to try to keep you alive.

RHollerith May 2, 2025, 2:35 PM
2 points
0
in reply to: Raphael Roche’s comment on: LLM AGI will have memory, and memory changes alignment
I am surprised by that because I’ve been avoiding learning about LLMs (including making any use of LLMs) till about a month ago, so it didn’t occur to me that implementing this might have been as easy as adding to the system prompt instructions for what kinds of information to put in the contextual memory file.

RHollerith May 2, 2025, 2:23 PM
2 points
0
in reply to: Raphael Roche’s comment on: LLM AGI will have memory, and memory changes alignment
This contextual memory file is edited by the user, never the AI?

RHollerith Apr 27, 2025, 8:40 PM
8 points
2
on: Questions for old LW members: how have discussions about AI changed compared to 10+ years ago?
In 2015, I didn’t write much about AI on Hacker News because even just explaining why it is dangerous will tend to spark enthusiasm for it in some people (people attracted to power, who notice that since it is dangerous, it must be powerful). These days, I don’t let that consideration stop me from write about AI.

RHollerith Apr 9, 2025, 9:01 PM
2 points
0
in reply to: mako yass’s comment on: Why do many people who care about AI Safety not clearly endorse PauseAI?
Good points, which in part explains why I think it is very very unlikely that AI research can be driven underground (in the US or worldwide). I was speaking to the desirability of driving it underground, not its feasibility.

RHollerith Mar 31, 2025, 3:12 AM
7 points
1
in reply to: mako yass’s comment on: Why do many people who care about AI Safety not clearly endorse PauseAI?

pausing means moving AI underground, and from what I can tell that would make it much harder to do safety research

I would be overjoyed if all AI research were driven underground! The main source of danger is the fact that there are thousands of AI researchers, most of whom are free to communicate and collaborate with each other. Lone researchers or small underground cells of researcher who cannot publish their results would be vastly less dangerous than the current AI research community even if there are many lone researchers and many small underground teams. And if we could make it illegal for these underground teams to generate revenue by selling AI-based services or to raise money from investors, that would bring me great joy, too.

Research can be modeled as a series of breakthroughs such that it is basically impossible to make breakthrough N before knowing about breakthrough N-1. If the researcher who makes breakthrough N-1 is unable to communicate it to researchers outside of his own small underground cell of researchers, then only that small underground cell or team has a chance at discovering breakthrough N, and research would proceed much more slowly than it does under current conditions.

The biggest hope for our survival is the quite likely and realistic hope that many thousands of person-years of intellectual effort that can only be done by the most talented among us remain to be done before anyone can create an AI that could extinct us. We should be making the working conditions of the (misguided) people doing that intellectual labor as difficult and unproductive as possible. We should restrict or cut off the labs’ access to revenue, to investment, to “compute” (GPUs), to electricity and to employees. Employees with the skills and knowledge to advance the field are a particularly important resource for the labs; consequently, we should reduce or restrict their number by making it as hard as possible (illegal preferably) to learn, publish, teach or lecture about deep learning.

Also, in my assessment, we are not getting much by having access to the AI researchers: we’re not persuading them to change how they operate and the information we are getting from them is of little help IMHO in the attempt to figure out alignment (in the original sense of the word where the AI stays aligned even if it becomes superhumanly capable).

The most promising alignment research IMHO is the kind that mostly ignores the deep-learning approach (which is the sole focus as far as I know of all the major labs) and inquires deeply into which approach to creating a superhumanly-capable AI would be particularly easy to align. That was the approach taken by MIRI before it concluded in 2022 that its resources were better spent trying to slow down the AI juggernaut through public persuasion.

Deep learning is a technology created by people who did not care about alignment or wrongly assumed alignment would be easy. There is a reason why MIRI mostly ignored deep learning when most AI researchers started to focus on it in 2006. It is probably a better route to aligned transformative AI to search for another, much-easier-to-align technology (that can eventually be made competitive in capabilities with deep learning) than to search for a method to align AIs created with deep-learning technology. (To be clear, I doubt that either approach will bear fruit in time unless the AI juggernaut can be slowed down considerably.) And of course if they will be mostly ignoring deep learning, there’s little alignment researchers can learn from the leading labs.

RHollerith Mar 30, 2025, 6:47 PM
2 points
0
in reply to: AnthonyC’s comment on: AI #109: Google Fails Marketing Forever
Impressive performance by the chatbot.

RHollerith Mar 24, 2025, 11:20 PM
2 points
0
in reply to: Garrett Baker’s comment on: rhollerith_dot_com’s Shortform
Maybe “motto” is the wrong word. I meant words / concepts to use in a comment or in a conversation.

“Those companies that created ChatGPT, etc? If allowed to continue operating without strict regulation, they will cause an intelligence explosion.”

RHollerith Mar 24, 2025, 6:31 PM
2 points
−6
on: rhollerith_dot_com’s Shortform
There’s a good chance that “postpone the intelligence explosion for a few centuries” is a better motto than “stop AI” or “pause AI”.

Someone should do some “market research” on this question.

RHollerith Mar 17, 2025, 3:25 PM
3 points
0
on: Can we ever ensure AI alignment if we can only test AI personas?
All 3 of the other replies to your question overlook the crispest consideration: namely, it is not possible to ensure the proper functioning of even something as simple as a circuit for division (such as we might find inside a CPU) through testing alone: there are too many possible inputs (too many pairs of possible 64-bit divisors and dividends) to test in one lifetime even if you make a million perfect copies of the circuit and test them in parallel.

Let us consider very briefly what else besides testing an engineer might do to ensure (or “verify” as the engineer would probably say) the proper operation of a circuit for dividing. The circuit is composed of 64 sub-circuits, each responsible for producing one bit of the output (i.e., the quotient to be calculated), and an engineer will know enough about arithmetic to know that the sub-circuit for calculating bit N should bear a close resemblance to the one for bit N+1: it might not be exactly identical, but any differences will be simple enough to be understood by a digital-design engineer—usually: in 1994, a bug was found in the floating-point division circuit of the Intel Pentium CPU, precipitating a product recall that cost Intel about $475 million. After that, Intel switched to a more reliable, but much more ponderous technique called “formal verification” of its CPUs.

My point is that the question you are asking is sort of a low-stakes question (if you don’t mind my saying) because there is a sharp limit to how useful testing can be: testing can reveal that the designers need to go back to the drawing board, but human designers can’t go back to the drawing board billions of times (because there is not enough time because human designers are not that fast) so most of the many tens or hundreds of bits of human-applied optimization pressure that will be required for any successful alignment effort will need to come from processes other than testing. Discussion of these other processes is more pressing than any discussion of testing.

Eliezer’s “Einstein’s Arrogance is directly applicable here although I see that that post uses “bits of evidence” and “bits of entanglement” instead of “bits of optimization pressure”.

Another important consideration is that there is probably no safe way to run most of the tests we would want to run on an AI much more powerful than we are.

RHollerith Mar 6, 2025, 10:58 PM
10 points
0
on: Which meat to eat: CO₂ vs Animal suffering

Let me reassure you that there’s more than enough protein available in plant-based foods. For example, here’s how much grams of protein there is in 100 gram of meat

That is misleading because most foods are mostly water, included the (cooked) meats you list, but the first 4 of the plant foods you list have had their water artificially removed: soy protein isolate; egg white, dried; spirulina algae, dried; baker’s yeast.

Moreover, the human gut digests and absorbs more of animal protein than of plant protein. Part of the reason for this is the plant protein includes more fragments that are impervious to digestive enzymes in the human gut and more fragments (e.g., lectins) that interfere with human physiology.

Moreover, there are many people who can and do eat 1 or even 2 lb of cooked meat every day without obvious short-term consequences whereas most people who would try to eat 1 lb of spirulina (dry weight) or baker’s yeast (dry weight) in a day would probably get acute distress of the gut before the end of the day even if the spirulina or yeast was mixed with plenty of other food containing plenty of water, fiber, etc. Or at least that would be my guess (having eaten small amounts of both things): has anyone made the experiment?

RHollerith Feb 27, 2025, 10:19 PM
2 points
0
in reply to: Hopenope’s comment on: LWLW’s Shortform
The very short answer is that the people with the most experience in alignment research (Eliezer and Nate Soares) say that without an AI pause lasting many decades the alignment project is essentially hopeless because there is not enough time. Sure, it is possible the alignment project succeeds in time, but the probability is really low.

Eliezer has said that AIs based on the deep-learning paradigm are probably particularly hard to align, so it would probably help to get a ban or a long pause on that paradigm even if research in other paradigms continues, but good luck getting even that because almost all of the value currently being provided by AI-based services are based on deep-learning AIs.

One would think that it would be reassuring to know that the people running the labs are really smart and obviously want to survive (and have their children survive) but it is only reassuring before one listens to what they say and reads what they write about their plans on how to prevent human extinction and other catastrophic risks. (The plans are all quite inadequate.)

RHollerith Feb 24, 2025, 5:58 PM
2 points
0
in reply to: halwer’s comment on: halwer’s Shortform
I’m going to use “goal system” instead of “goals” because a list of goals is underspecified without some method for choosing which goal prevails when two goals “disagree” on the value of some outcome.

wouldn’t we then want ai to improve its own goals to achieve new ones that have increased effectiveness and improving the value of the world?

That is contradictory: the AI’s goal system is the single source of truth for the effectiveness and how much of an improvement is any change in the world.

RHollerith Feb 24, 2025, 5:46 PM
2 points
0
on: Poll on AI opinions.
I would need a definition of AGI before I could sensibly answer those questions.

ChatGPT is already an artificial general intelligence by the definition I have been using for the last 25 years.

RHollerith Feb 22, 2025, 5:20 PM
2 points
0
in reply to: Kajus’s comment on: Open Thread Winter 2024/2025
I think the leaders of the labs have enough private doubts about the safety of their enterprise that if an effective alignment method were available to them, they would probably adopt the method (especially if the group that devised the method do not seem particularly to care who gets credit for having devised it). I.e., my guess is that almost all of the difficulty is in devising an effective alignment method, not getting the leading lab to adopt it. (Making 100% sure that the leading lab adopts it is almost impossible, but acting in such a way that the leading lab will adopt it with p = .6 is easy, and the current situation is so dire that we should jump at any intervention with a .6 chance of a good outcome.)

Eliezer stated recently (during an interview on video) that the deep-learning paradigm seems particularly hard to align, so it would be nice to get the labs to focus on a different paradigm (even if we do not yet have a way to align the different paradigm) but that seems almost impossible unless and until the other paradigm has been developed to the extent that it can create models that are approximately as capable as deep-learning models.

The big picture is that the alignment project seems almost completely hopeless IMHO because of the difficulty of aligning the kind of designs the labs are using and the difficulty of inducing the labs to switch to easier-to-align designs.

RHollerith Feb 19, 2025, 3:33 PM
3 points
0
on: how do the CEOs respond to our concerns?
Your question would have been better without the dig at theists and non-vegans.