Daniel Kokotajlo comments on AI Regulation is Unsafe

Daniel Kokotajlo 22 Apr 2024 19:36 UTC
62 points
64
I think this post doesn’t engage with the arguments for anti-x-risk AI regulation, to a degree that I am frustrated by. Surely you know that the people you are talking to—the proponents of anti-x-risk AI regulation—are well aware of the flaws of governments generally and the way in which regulation often stifles progress or results in regulatory capture that is actively harmful to whatever cause it was supposed to support. Surely you know that e.g. Yudkowsky is well aware of all this… right?

The core argument these regulation proponents are making and have been making is “Yeah, regulation often sucks or misfires, but we have to try, because worlds that don’t regulate AGI are even more doomed.”

What do you think will happen, if there is no regulation of AGI? Seriously how do you think it will go? Maybe you just don’t think AGI will happen for another decade or so?
- Richard_Ngo 24 Apr 2024 22:51 UTC
  28 points
  10
  Parent
  I don’t actually think proponents of anti-x-risk AI regulation have thought very much about the ways in which regulatory capture might in fact be harmful to reducing AI x-risk. At least, I haven’t seen much writing about this, nor has it come up in many of the discussions I’ve had (except insofar as I brought it up).
  In general I am against arguments of the form “X is terrible but we have to try it because worlds that don’t do it are even more doomed”. I’ll steal Scott Garrabrant’s quote from here:
  “If you think everything is doomed, you should try not to mess anything up. If your worldview is right, we probably lose, so our best out is the one where your your worldview is somehow wrong. In that world, we don’t want mistaken people to take big unilateral risk-seeking actions.
  Until recently, people with P(doom) of, say, 10%, have been natural allies of people with P(doom) of >80%. But the regulation that the latter group thinks is sufficient to avoid xrisk with high confidence has, on my worldview, a significant chance of either causing x-risk from totalitarianism, or else causing x-risk via governments being worse at alignment than companies would have been. How high? Not sure, but plausibly enough to make these two groups no longer natural allies.
  - Akash 25 Apr 2024 0:58 UTC
    15 points
    9
    Parent
    I’m not sure who you’ve spoken to, but at least among the AI policy people who I talk to regularly (which admittedly is a subset of people who I think are doing the most thoughtful/serious work), I think nearly all of them have thought about ways in which regulation + regulatory capture could be net negative. At least to the point of being able to name the relatively “easy” ways (e.g., governments being worse at alignment than companies).
    I continue to think people should be forming alliances with those who share similar policy objectives, rather than simply those who belong in the “I believe xrisk is a big deal” camp. I’ve seen many instances in which the “everyone who believes xrisk is a big deal belongs to the same camp” mentality has been used to dissuade people from communicating their beliefs, communicating with policymakers, brainstorming ideas that involve coordination with other groups in the world, disagreeing with the mainline views held by a few AIS leaders, etc.
    The cultural pressures against policy advocacy have been so strong that it’s not surprising to see folks say things like “perhaps our groups are no longer natural allies” now that some of the xrisk-concerned people are beginning to say things like “perhaps the government should have more of a say in how AGI development goes than in status quo, where the government has played ~0 role and ~all decisions have been made by private companies.”
    Perhaps there’s a multiverse out there in which the AGI community ended up attracting govt natsec folks instead of Bay Area libertarians, and the cultural pressures are flipped. Perhaps in that world, the default cultural incentives pushed people heavily brainstorming ways that markets and companies could contribute meaningfully to the AGI discourse, and the default position for the “AI risk is a big deal” camp was “well obviously the government should be able to decide what happens and it would be ridiculous to get companies involved– don’t be unilateralist by going and telling VCs about this stuff.”
    I bring up this (admittedly kinda weird) hypothetical to point out just how skewed the status quo is. One might generally be wary of government overinvolvement in regulating emerging technologies yet still recognize that some degree of regulation is useful, and that position would likely still push them to be in the “we need more regulation than we currently have” camp.
    As a final note, I’ll point out to readers less familiar with the AI policy world that serious people are proposing lots of regulation that is in between “status quo with virtually no regulation” and “full-on pause.” Some of my personal favorite examples include: emergency preparedness (akin to the OPPR), licensing (see Romney), reporting requirements, mandatory technical standards enforced via regulators, and public-private partnerships.
    - Richard_Ngo 25 Apr 2024 1:02 UTC
      4 points
      0
      Parent
      I’m not sure who you’ve spoken to, but at least among the people who I talk to regularly who I consider to be doing “serious AI policy work” (which admittedly is not everyone who claims to be doing AI policy work), I think nearly all of them have thought about ways in which regulation + regulatory capture could be net negative. At least to the point of being able to name the relatively “easy” ways (e.g., governments being worse at alignment than companies).
      I don’t disagree with this; when I say “thought very much” I mean e.g. to the point of writing papers about it, or even blog posts, or analyzing it in talks, or basically anything more than cursory brainstorming. Maybe I just haven’t seen that stuff, idk.
  - Daniel Kokotajlo 25 Apr 2024 18:10 UTC
    10 points
    3
    Parent
    Hmm, I’m a bit surprised to hear you say that. I feel like I myself brought up regulatory capture a bunch of times in our conversations over the last two years. I think I even said it was the most likely scenario, in fact, and that it was making me seriously question whether what we were doing was helpful. Is this not how you remember it? Wanna hop on a call to discuss?
    
    As for arguments of that form… I didn’t say X is terrible, I said it often goes badly. If you round that off to “X is terrible” and fit it into the argument-form you are generally against, then I think to be consistent you’d have to give a similar treatment to a lot of common sense good things. Like e.g. doing surgery on a patient who seems likely to die absent treatment.
    
    I also think we might be talking past each other re regulation. As I said elsewhere in this discussion (on the OP’s blog) I am not in favor of generically increasing the amount of AI regulation in the world. I would instead advocate for something more targeted—regulation that I actually think would really work, if implemented well. And I’m very concerned about the “if implemented well” part and have a lot to say about that too.
    
    What are the regulations that you are concerned about, that (a) are being seriously advocated by people with P(doom) >80%, and (b) that have a significant chance of causing x-risk via totalitarianism or technical-alignment-incompetence?
  - Matthew Barnett 25 Apr 2024 0:28 UTC
    3 points
    −22
    Parent
    Until recently, people with P(doom) of, say, 10%, have been natural allies of people with P(doom) of >80%. But the regulation that the latter group thinks is sufficient to avoid xrisk with high confidence has, on my worldview, a significant chance of either causing x-risk from totalitarianism, or else causing x-risk via governments being worse at alignment than companies would have been.
    I agree. Moreover, a p(doom) of 10% vs. 80% means a lot for people like me who think the current generation of humans have substantial moral value (i.e., people who aren’t fully committed to longtermism).
    In the p(doom)=10% case, burdensome regulations that appreciably delay AI, or greatly reduce the impact of AI, have a large chance of causing the premature deaths of people who currently exist, including our family and friends. This is really bad if you care significantly about people who currently exist.
    This consideration is sometimes neglected in these discussions, perhaps because it’s seen as a form of selfish partiality that we should toss aside. But in my opinion, morality is allowed to be partial. Morality is whatever we want it to be. And I don’t have a strong urge to sacrifice everyone I know and love for the sake of slightly increasing (in my view) the chance of the human species being preserved.
    (The additional considerations of potential totalitarianism, public choice arguments, and the fact that I think unaligned AIs will probably have moral value, make me quite averse to very strong regulatory controls on AI.)
    - Daniel Kokotajlo 26 Apr 2024 14:51 UTC
      3 points
      0
      Parent
      So, it sounds like you’d be in favor of a 1-year pause or slowdown then, but not a 10-year?
      
      (Also, I object to your side-swipe at longtermism. Longtermism according to wikipedia is “Longtermism is the ethical view that positively influencing the long-term future is a key moral priority of our time.” “A key moral priority” doesn’t mean “the only thing that has substantial moral value.” If you had instead dunked on classic utilitarianism, I would have agreed.)
      - Matthew Barnett 26 Apr 2024 17:53 UTC
        2 points
        0
        Parent
        
        So, it sounds like you’d be in favor of a 1-year pause or slowdown then, but not a 10-year?
        
        That depends on the benefits that we get from a 1-year pause. I’d be open to the policy, but I’m not currently convinced that the benefits would be large enough to justify the costs.
        
        Also, I object to your side-swipe at longtermism
        
        I didn’t side-swipe at longtermism, or try to dunk on it. I think longtermism is a decent philosophy, and I consider myself a longtermist in the dictionary sense as you quoted. I was simply talking about people who aren’t “fully committed” to the (strong) version of the philosophy.
        Daniel Kokotajlo 26 Apr 2024 23:07 UTC
        2 points
        0
        Parent
        OK, thanks for clarifying.
        
        Personally I think a 1-year pause right around the time of AGI would give us something like 50% of the benefits of a 10-year pause. That’s just an initial guess, not super stable. And quantitatively I think it would improve overall chances of AGI going well by double-digit percentage points at least. Such that it makes sense to do a 1-year pause even for the sake of an elderly relative avoiding death from cancer, not to mention all the younger people alive today.
        Matthew Barnett 26 Apr 2024 23:34 UTC
        4 points
        0
        Parent
        And quantitatively I think it would improve overall chances of AGI going well by double-digit percentage points at least.
        Makes sense. By comparison, my own unconditional estimate of p(doom) is not much higher than 10%, and so it’s hard on my view for any intervention to have a double-digit percentage point effect.
        The crude mortality rate before the pandemic was about 0.7%. If we use that number to estimate the direct cost of a 1-year pause, then this is the bar that we’d need to clear for a pause to be justified. I find it plausible that this bar could be met, but at the same time, I am also pretty skeptical of the mechanisms various people have given for how a pause will help with AI safety.
        Daniel Kokotajlo 27 Apr 2024 0:19 UTC
        5 points
        3
        Parent
        I agree that 0.7% is the number to beat for people who mostly focus on helping present humans and who don’t take acausal or simulation argument stuff or cryonics seriously. I think that even if I was much more optimistic about AI alignment, I’d still think that number would be fairly plausibly beaten by a 1-year pause that begins right around the time of AGI.
        
        What are the mechanisms people have given and why are you skeptical of them?
        ryan_greenblatt 27 Apr 2024 0:42 UTC
        4 points
        0
        Parent
        (Surely cryonics doesn’t matter given a realistic action space? Usage of cryonics is extremely rare and I don’t think there are plausible (cheap) mechanisms to increase uptake to >1% of population. I agree that simulation arguments and similar considerations maybe imply that “helping current humans” is either incoherant or unimportant.)
        Daniel Kokotajlo 27 Apr 2024 15:00 UTC
        4 points
        2
        Parent
        Good point, I guess I was thinking in that case about people who care a bunch about a smaller group of humans e.g. their family and friends.
        Amalthea 27 Apr 2024 0:11 UTC
        4 points
        4
        Parent
        Somewhat of a nitpick, but the relevant number would be p(doom | strong AGI being built) (maybe contrasted with p(utopia | strong AGI)) , not overall p(doom).
    - quetzal_rainbow 26 Apr 2024 15:53 UTC
      −12 points
      −5
      Parent
      May I strongly recommend that you try to become a Dark Lord instead?
      I mean, literally. Stage some small bloody civil war with expected body count of several millions, become dictator, provide everyone free insurance coverage for cryonics, it will be sure more ethical than 10% of chance of killing literally everyone from the perspective of most of ethical systems I know.
      - Matthew Barnett 26 Apr 2024 20:34 UTC
        5 points
        3
        Parent
        I don’t think staging a civil war is generally a good way of saving lives. Moreover, ordinary aging has about a 100% chance of “killing literally everyone” prematurely, so it’s unclear to me what moral distinction you’re trying to make in your comment. It’s possible you think that:
        Death from aging is not as bad as death from AI because aging is natural whereas AI is artificial
        Death from aging is not as bad as death from AI because human civilization would continue if everyone dies from aging, whereas it would not continue if AI kills everyone
        In the case of (1) I’m not sure I share the intuition. Being forced to die from old age seems, if anything, worse than being forced to die from AI, since it is long and drawn-out, and presumably more painful than death from AI. You might also think about this dilemma in terms of act vs. omission, but I am not convinced there’s a clear asymmetry here.
        In the case of (2), whether AI takeover is worse depends on how bad you think an “AI civilization” would be in the absence of humans. I recently wrote a post about some reasons to think that it wouldn’t be much worse than a human civilization.
        In any case, I think this is simply a comparison between “everyone literally dies” vs. “everyone might literally die but in a different way”. So I don’t think it’s clear that pushing for one over the other makes someone a “Dark Lord”, in the morally relevant sense, compared to the alternative.
        Amalthea 27 Apr 2024 2:32 UTC
        1 point
        0
        Parent
        I think the perspective that you’re missing regarding 2. is that by building AGI one is taking the chance of non-consensually killing vast amounts of people and their children for some chance of improving one’s own longevity.
        
        Even if one thinks it’s a better deal for them, a key point is that you are making the decision for them by unilaterally building AGI. So in that sense it is quite reasonable to see it as an “evil” action to work towards that outcome.
        Matthew Barnett 27 Apr 2024 3:02 UTC
        4 points
        2
        Parent
        non-consensually killing vast amounts of people and their children for some chance of improving one’s own longevity.
        I think this misrepresents the scenario since AGI presumably won’t just improve my own longevity: it will presumably improve most people’s longevity (assuming it does that at all), in addition to all the other benefits that AGI would provide the world. Also, both potential decisions are “unilateral”: if some group forcibly stops AGI development, they’re causing everyone else to non-consensually die from old age, by assumption.
        I understand you have the intuition that there’s an important asymmetry here. However, even if that’s true, I think it’s important to strive to be accurate when describing the moral choice here.
        Amalthea 27 Apr 2024 4:14 UTC
        3 points
        0
        Parent
        I agree that potentially the benefits can go to everyone. The point is that as the person pursuing AGI you are making the choice for everyone else.
        The asymmetry is that if you do something that creates risk for everyone else, I believe that does single you out as an aggressor? While conversely, enforcing norms that prevent such risky behavior seems justified. The fact that by default people are mortal is tragic, but doesn’t have much bearing here. (You’d still be free to pursue life-extension technology in other ways, perhaps including limited AI tools).
        Ideally, of course, there’d be some sort of democratic process here that let’s people in aggregate make informed (!) choices. In the real world, it’s unclear what a good solution here would be. What we have right now is the big labs creating facts that society has trouble catching up with, which I think many people are reasonably uncomfortable with.
  - quetzal_rainbow 25 Apr 2024 19:33 UTC
    1 point
    0
    Parent
    governments being worse at alignment than companies would have been
    How exactly absence of regulation prevents governments from working on AI? Thanks to OpenAI/DeepMind/Anthropic, possibility of not attracting government attention at all is already lost. If you want government to not do bad work on alignment, you should prohibit government to work on AI using, yes, government regulations.
- Maxwell Tabarrok 22 Apr 2024 20:03 UTC
  11 points
  9
  Parent
  Daniel and I continue the comment thread here
  https://open.substack.com/pub/maximumprogress/p/ai-regulation-is-unsafe?r=awlwu&utm_campaign=comment-list-share-cta&utm_medium=web&comments=true&commentId=54561569
  - quila 23 Apr 2024 0:52 UTC
    30 points
    4
    Parent
    (crossposting here to avoid trivial inconveniences)
- Quadratic Reciprocity 23 Apr 2024 1:40 UTC
  9 points
  0
  Parent
  From the comment thread:
  
  I’m not a fan of *generic* regulation-boosting. Like, if I just had a megaphone to shout to the world, “More regulation of AI!” I would not use it. I want to do more targeted advocacy of regulation that I think is more likely to be good and less likely to result in regulatory-capture
  What are specific regulations / existing proposals that you think are likely to be good? When people are protesting to pause AI, what do you want them to be speaking into a megaphone (if you think those kinds of protests could be helpful at all right now)?
  - Daniel Kokotajlo 23 Apr 2024 3:31 UTC
    21 points
    15
    Parent
    Reporting requirements, especially requirements to report to the public what your internal system capabilities are, so that it’s impossible to have a secret AGI project. Also reporting requirements of the form “write a document explaining what capabilities, goals/values, constraints, etc. your AIs are supposed to have, and justifying those claims, and submit it to public scrutiny. So e.g. if your argument is ‘we RLHF’d it to have those goals and constraints, and that probably works because there’s No Evidence of deceptive alignment or other speculative failure modes’ then at least the world can see that no, you don’t have any better arguments than that.
    
    That would be my minimal proposal. My maximal proposal would be something like “AGI research must be conducted in one place: the United Nations AGI Project, with a diverse group of nations able to see what’s happening in the project and vote on each new major training run and have their own experts argue about the safety case etc.”
    
    There’s a bunch of options in between.
    
    I’d be quite happy with an AGI Pause if it happened, I just don’t think it’s going to happen, the corporations are too powerful. I also think that some of the other proposals are strictly better while also being more politically feasible. (They are more complicated and easily corrupted though, which to me is the appeal of calling for a pause. Harder to get regulatory-captured than something more nuanced.)
- Rudi C 23 Apr 2024 17:14 UTC
  2 points
  0
  Parent
  A core disagreement is over “more doomed.” Human extinction is preferable to a totalitarian stagnant state. I believe that people pushing for totalitarianism have never lived under it.
  - Wei Dai 25 Apr 2024 3:54 UTC
    15 points
    18
    Parent
    I’ve arguably lived under totalitarianism (depending on how you define it), and my parents definitely have and told me many stories about it. I think AGI increases risk of totalitarianism, and support a pause in part to have more time to figure out how to make the AI transition go well in that regard.
    - Rudi C 26 Apr 2024 19:30 UTC
      5 points
      4
      Parent
      AGI might increase the risk of totalitarianism. OTOH, a shift in the attack-defense balance could potentially boost the veto power of individuals, so it might also work as a deterrent or a force for anarchy.
      
      This is not the crux of my argument, however. The current regulatory Overton window seems to heavily favor a selective pause of AGI, such that centralized powers will continue ahead, even if slower due to their inherent inefficiencies. Nuclear development provides further historical evidence for this. Closed AGI development will almost surely lead to a dystopic totalitarian regime. The track record of Lesswrong is not rosy here; the “Pivotal Act” still seems to be in popular favor, and OpenAI has significantly accelerated closed AGI development while lobbying to close off open research and pioneering the new “AI Safety” that has been nothing but censorship and double-think as of 2024.
      - Daniel Kokotajlo 27 Apr 2024 15:34 UTC
        14 points
        4
        Parent
        I think most people pushing for a pause are trying to push against a ‘selective pause’ and for an actual pause that would apply to the big labs who are at the forefront of progress. I agree with you, however, that the current overton window seems unfortunately centered around some combination of evals-and-mitigations that is at IMO high risk of regulatory capture (i.e. resulting in a selective pause that doesn’t apply to the big corporations that most need to pause!) My disillusionment about this is part of why I left OpenAI.
    - Daniel Kokotajlo 26 Apr 2024 14:52 UTC
      4 points
      2
      Parent
      Big +1 to that. Part of why I support (some kinds of) AI regulation is that I think they’ll reduce the risk of totalitarianism, not increase it.
  - Daniel Kokotajlo 23 Apr 2024 18:41 UTC
    12 points
    6
    Parent
    Who is pushing for totalitarianism? I dispute that AI safety people are pushing for totalitarianism.
    - RussellThor 24 Apr 2024 10:05 UTC
      8 points
      7
      Parent
      There is a belief among some people that our current tech level will lead to totalitarianism by default. The argument is that with 1970′s tech the soviet union collapsed, however with 2020 computer tech (not needing GenAI) it would not. If a democracy goes bad, unlike before there is no coming back. For example Xinjiang—Stalin would have liked to do something like that but couldn’t. When you add LLM AI on everyone’s phone + Video/Speech recognition, organized protest is impossible.
      Not sure if Rudi C is making this exact argument. Anyway if we get mass centralization/totalitarianism worldwide, then S risk is pretty reasonable. AI will be developed under such circumstances to oppress 99% of the population—then goes to 100% with extinction being better.
      I find it hard to know how likely this is. Is clear to me that tech has enabled totalitarianism but hard to give odds etc.
      - Wei Dai 25 Apr 2024 4:02 UTC
        5 points
        1
        Parent
        
        The argument is that with 1970′s tech the soviet union collapsed, however with 2020 computer tech (not needing GenAI) it would not.
        
        I note that China is still doing market economics, and nobody is trying (or even advocating, AFAIK) some very ambitious centrally planned economy using modern computers, so this seems like pure speculation? Has someone actually made a detailed argument about this, or at least has the agreement of some people with reasonable economics intuitions?
        RussellThor 25 Apr 2024 5:54 UTC
        1 point
        0
        Parent
        No I have not seen a detailed argument about this, just the claim that once centralization goes past a certain point there is no coming back. I would like to see such an argument/investigation as I think it is quite important. “Yuval Harari” does say something similar in “Sapiens”
  - MondSemmel 23 Apr 2024 18:09 UTC
    4 points
    6
    Parent
    Flippant response: people pushing for human extinction have never been dead under it, either.