Nathan Helm-Burger comments on What are the strongest arguments for very short timelines?

Nathan Helm-Burger 23 Dec 2024 20:51 UTC
9 points
−3
Lots of disagree votes, but no discussion. So annoying when that happens.

Propose a bet! Ask for my sources! Point out a flaw in my reasoning! Don’t just disagree and walk away!
- Vladimir_Nesov 23 Dec 2024 22:42 UTC
  22 points
  22
  Parent
  
  Don’t just disagree and walk away!
  
  Feeding this norm creates friction, filters evidence elicited in the agreement-voting. If there is a sense that a vote needs to be explained, it often won’t be cast.
  - habryka 23 Dec 2024 22:47 UTC
    7 points
    13
    Parent
    Agree. I do think it is annoying, but allowing people to do that is quite crucial for the integrity of the voting system.
    - Nathan Helm-Burger 24 Dec 2024 1:56 UTC
      3 points
      0
      Parent
      I’m not interested in modifying the site to prevent this, or even really modifying the underlying norm (much). I guess I’m just complaining on my own behalf, making an ask of the community. And the result was that the squeaky wheel got the grease, I got the thoughtful engagement I was hoping for.
      
      So, in conclusion, I’ll try to frame my future pleas for engagement in a way which doesn’t imply I think the norm of voting-without-commenting is bad.
- Rafael Harth 23 Dec 2024 21:42 UTC
  5 points
  1
  Parent
  I feel like a bet is fundamentally unfair here because in the cases where I’m wrong, there’s a high chance that I’ll be dead anyway and don’t have to pay. The combination of long timelines but high P(doom|AGI soon) means I’m not really risking my reputation/money in the way I’m supposed to with a bet. Are you optimistic about alignment, or does this asymmetry not bother you for other reasons? (And I don’t have the money to make a big bet regardless.)
  - Nathan Helm-Burger 23 Dec 2024 21:57 UTC
    6 points
    0
    Parent
    Great question! Short answer: I’m optimistic about muddling through with partial alignment combined with AI control and AI governance (limiting peak AI capabilities, global enforcement of anti-rogue-AI, anti-self-improving-AI, and anti-self-replicating-weapons laws). See my post “A Path to Human Autonomy” for more details.
    
    I also don’t have money for big bets. I’m more interested in mostly-reputation-wagers about the very near future. So that I might get my reputational returns in time for them to pay off in respectful-attention-from-powerful-decisionmakers, which in turn I would hope might pay off in better outcomes for me, my loved ones, and humanity.
    
    If I am incorrect, then I want to not be given the ear of decision makers, and I want them to instead pay more attention to someone with better models than me. Thus, seems to me like a fairly win-win situation to be making short term reputational bets.
    - Rafael Harth 23 Dec 2024 22:03 UTC
      4 points
      0
      Parent
      Gotcha. I’m happy to offer 600 of my reputation points vs. 200 of yours on your description of 2026-2028 not panning out. (In general if it becomes obvious^[1] that we’re racing toward ASI in the next few years, then people should probably not take me seriously anymore.)
      
      ↩︎
      well, so obvious that I agree, anyway; apparently it’s already obvious to some people.
      - Nathan Helm-Burger 23 Dec 2024 22:07 UTC
        4 points
        0
        Parent
        I’ll happily accept that bet, but maybe we could also come up with something more specific about the next 12 months?
        
        Example: https://manifold.markets/MaxHarms/will-ai-be-recursively-self-improvi
        Rafael Harth 23 Dec 2024 22:30 UTC
        4 points
        0
        Parent
        Not that one; I would not be shocked if this market resolves Yes. I don’t have an alternative operationalization on hand; would have to be about AI doing serious intellectual work on real problems without any human input. (My model permits AI to be very useful in assisting humans.)
        Nathan Helm-Burger 24 Dec 2024 3:37 UTC
        4 points
        0
        Parent
        Hmm, yes. I agree that there’s something about self-guiding /self-correcting on complex lengthy open-ended tasks where current AIs seem at near-zero performance.
        
        I do expect this to improve dramatically in the next 12 months. I think this current lack is more about limitations in the training regimes so far, rather than limitations in algorithms/architectures.
        
        Contrast this with the challengingness of ARC-AGI, which seems like maybe an architecture weakness?
      - yo-cuddles 23 Dec 2024 22:29 UTC
        3 points
        0
        Parent
        Can we bet karma?
        
        Edit: sarcasm