Ebenezer Dukakis comments on MIRI’s June 2024 Newsletter

Ebenezer Dukakis 16 Jun 2024 13:31 UTC
3 points
−4
So what’s the path by which our “general understanding of the situation” is supposed to improve? There’s little point in delaying timelines by a year, if no useful alignment research is done in that year. The overall goal should be to maximize the product of timeline delay and rate of alignment insights.

Also, I think you may be underestimating the ability of newcomers to notice that MIRI tends to ignore its strongest critics. See also previously linked comment.
- Orpheus16 16 Jun 2024 13:49 UTC
  15 points
  5
  Parent
  I think if MIRI engages with “curious newcomers” those newcomers will have their own questions/confusions/objections and engaging with those will improve general understanding.
  
  Based on my experience so far, I don’t expect their questions/confusions/objections to overlap a lot with the questions/confusions/objections that tech-oriented active LW users have.
  
  I also think it’s not accurate to say that MIRI tends to ignore its strongest critics; there’s perhaps more public writing/dialogues between MIRI and its critics than for pretty much any other organization in the space.
  
  My claim is not that MIRI should ignore its critics but moreso that it should focus on replying to criticisms or confusions from “curious and important newcomers”. My fear is that MIRI might engage too much with criticisms from LW users and other ingroup members and not focus enough on engaging with policy folks, whose cruxes and opinions often differ substantially than EG the median LW commentator.
  - Ebenezer Dukakis 16 Jun 2024 14:38 UTC
    2 points
    −10
    Parent
    
    I think if MIRI engages with “curious newcomers” those newcomers will have their own questions/confusions/objections and engaging with those will improve general understanding.
    
    You think policymakers will ask the sort of questions that lead to a solution for alignment?
    
    In my mind, the most plausible way “improve general understanding” can advance the research frontier for alignment is if you’re improving the general understanding of people fairly near that frontier.
    
    Based on my experience so far, I don’t expect their questions/confusions/objections to overlap a lot with the questions/confusions/objections that tech-oriented active LW users have.
    
    I expect MIRI is not the only tech-oriented group policymakers are talking to. So in the long run, it’s valuable for MIRI to either (a) convince other tech-oriented groups of its views, or (b) provide arguments that will stand up against those from other tech-oriented groups.
    
    there’s perhaps more public writing/dialogues between MIRI and its critics than for pretty much any other organization in the space.
    
    I believe they are also the only organization in the space that says its main focus is on communications. I’m puzzled that multiple full-time paid staff are getting out-argued by folks like Alex Turner who are posting for free in their spare time.
    
    If MIRI wants us to make use of any added timeline in a way that’s useful, or make arguments that outsiders will consider robust, I think they should consider a technical communications strategy in addition to a general-public communications strategy. The wave-rock model could help for technical communications as well. Right now their wave game for technical communications seems somewhat nonexistent. E.g. compare Eliezer’s posting frequency on LW vs X.
    
    You depict a tradeoff between focusing on “ingroup members” vs “policy folks”, but I suspect there are other factors which are causing their overall output to be low, given their budget and staffing levels. E.g. perhaps it’s an excessive concern with org reputation that leads them to be overly guarded in their public statements. In which case they could hire an intern to argue online for 40 hours a week, and if the intern says something dumb, MIRI can say “they were just an intern—and now we fired them.” (Just spitballing here.)
    
    It’s puzzling to me that MIRI originally created LW for the purpose of improving humanity’s thinking about AI, and now Rob says that’s their “main focus”, yet they don’t seem to use LW that much? Nate hasn’t said anything about alignment here in the past ~6 months. I don’t exactly see them arguing with the ingroup too much.
    - Orpheus16 16 Jun 2024 17:36 UTC
      4 points
      1
      Parent
      Don’t have time to respond in detail but a few quick clarifications/responses:
      
      — I expect policymakers to have the most relevant/important questions about policy and to be the target audience most relevant for enacting policies. Not solving technical alignment. (Though I do suspect that by MIRI’s lights, getting policymakers to understand alignment issues would be more likely to result in alignment progress than having more conversations with people in the technical alignment space.)
      
      — There are lots of groups focused on comms/governance. MIRI is unique only insofar as it started off as a “technical research org” and has recently pivoted more toward comms/governance.
      
      — I do agree that MIRI has had relatively low output for a group of its size/resources/intellectual caliber. I would love to see more output from MIRI in general. Insofar as it is constrained, I think they should be prioritizing “curious policy newcomers” over people like Matthew and Alex. — Minor but I don’t think MIRI is getting “outargued” by those individuals and I think that frame is a bit too zero-sum.
      
      — Controlling for overall level of output, I suspect I’m more excited than you about MIRI spending less time on LW and more time on comms/policy work with policy communities (EG Malo contributing to the Schumer insight forums, MIRI responding to government RFCs). — My guess is we both agree that MIRI could be doing more on both fronts and just generally having higher output. My impression is they are working on this and have been focusing on hiring; I think if their output stayed relatively the same 3-6 months from now I will be fairly disappointed.
      - Ebenezer Dukakis 17 Jun 2024 0:52 UTC
        1 point
        0
        Parent
        
        Don’t have time to respond in detail but a few quick clarifications/responses:
        
        Sure, don’t feel obligated to respond, and I invite the people disagree-voting my comments to hop in as well.
        
        — There are lots of groups focused on comms/governance. MIRI is unique only insofar as it started off as a “technical research org” and has recently pivoted more toward comms/governance.
        
        That’s fair, when you said “pretty much any other organization in the space” I was thinking of technical orgs.
        
        MIRI’s uniqueness does seem to suggest it has a comparative advantage for technical comms. Are there any organizations focused on that?
        
        by MIRI’s lights, getting policymakers to understand alignment issues would be more likely to result in alignment progress than having more conversations with people in the technical alignment space
        
        By ‘alignment progress’ do you mean an increased rate of insights per year? Due to increased alignment funding?
        
        Anyway, I don’t think you’re going to get “shut it all down” without either a warning shot or a congressional hearing.
        
        If you just extrapolate trends, it wouldn’t particularly surprise me to see Alex Turner at a congressional hearing arguing against “shut it all down”. Big AI has an incentive to find the best witnesses it can, and Alex Turner seems to be getting steadily more annoyed. (As am I, fwiw.)
        
        Again, extrapolating trends, I expect MIRI’s critics like Nora Belrose will increasingly shift from the “inside game” of trying to engage w/ MIRI directly to a more “outside game” strategy of explaining to outsiders why they don’t think MIRI is credible. After the US “shuts it down”, countries like the UAE (accused of sponsoring genocide in Sudan) will likely try to quietly scoop up US AI talent. If MIRI is considered discredited in the technical community, I expect many AI researchers to accept that offer instead of retooling their career. Remember, a key mistake the board made in the OpenAI drama was underestimating the amount of leverage that individual AI researchers have, and not trying to gain mindshare with them.
        
        Pause maximalism (by which I mean focusing 100% on getting a pause and not trying to speed alignment progress) only makes sense to me if we’re getting a ~complete ~indefinite pause. I’m not seeing a clear story for how that actually happens, absent a much broader doomer consensus. And if you’re not able to persuade your friends, you shouldn’t expect to persuade your enemies.
        
        Right now I think MIRI only gets their stated objective in a world where we get a warning shot which creates a broader doom consensus. In that world it’s not clear advocacy makes a difference on the margin.