Jiro comments on MIRI announces new “Death With Dignity” strategy

Jiro 19 Apr 2022 19:55 UTC
6 points
If in, let’s say, two years it turns out that we have not faced serious danger from hostile AI, will Eliezer then agree that his reasoning went down the wrong path and AI danger is substantially less than he had claimed? (Without explanations such as “well, we’re still in danger but you have to wait a little longer to see it”)
Because the higher your confidence in X, the more you are wrong if not-X turns out to be the case.
- Eliezer Yudkowsky 20 Apr 2022 10:59 UTC
  15 points
  Parent
  My timelines are mostly (substantially over 50%) not that short. Can you tell me where you got the impression they were?
  - Jiro 20 Apr 2022 14:55 UTC
    7 points
    Parent
    Fair enough. Whatever the actual timeline is, name a number of years before the inevitable AI catastrophe that humans are not going to solve. Your post here suggests it’s at least less than thirty. If that time passes and there is no AI catastrophe, your confidence that you understand AI risk should drastically go down. (And no “well, wait a few more years after the number I originally gave you” either.)
    
    And if you can’t name a number of years, that should heavily discount how urgent everyone should treat it as, because you are in a situation where exaggerating its urgency has no risk to you.
    - Daniel Kokotajlo 20 Apr 2022 15:01 UTC
      14 points
      0
      Parent
      If twenty years from now you and I are both are still alive and free, I will happily say “You were right Jiro, I was wrong.”
      - Jiro 20 Apr 2022 15:48 UTC
        7 points
        0
        Parent
        Eliezer is the person who made the prediction to an audience and who is being taken as an authority, not you.
    - Chris_Leong 24 Apr 2022 11:29 UTC
      4 points
      Parent
      Hmm… I definitely think people should update if a long time passes and what Eliezer says doesn’t come to pass.
      On the other hand, I don’t know how I feel about “reputational staking”. Part of the problem is that in order to have a bet you need a counterparty. If Eliezer does stake his reputation then perhaps a few people would become marginally more concerned about safety, but if later he was to recant, that’ll probably affect a lot more people. So it’s unclear whether there’s a viable bet here.
      - Jiro 25 Apr 2022 17:47 UTC
        9 points
        0
        Parent
        That’s why I didn’t call this a bet. (I also didn’t demand he put any money on it, something which rationalists sometimes like and which has its own problems).
        
        The thing about having a counterparty is that this is already asymmetrical. Eliezer is making a dramatic, catastrophic, prediction. If he turns out to be correct, then of course I’ll be proven wrong. I won’t have any other choice but to admit that I’m wrong, as we’re all herded into shredders so our bodies can be used to make paperclips.
        
        But can Eliezer be proven wrong? No, not if he makes it vague about exactly how long we need to wait, and if he leaves open the possibility of “oh, I said 5 years? I meant 7. No, I meant 9....”
        
        And if he can’t be proven wrong, he has no incentive not to exaggerate the danger. The way it should work that the more catastrophic your prediction is, the worse you look when you fail, so you’re not going to exaggerate the danger just to get people to listen to you.
    - Portia 5 Mar 2023 2:01 UTC
      1 point
      Parent
      Agreed. The current statements are so vague as to be impossible to falsify, and hence, they are not scientific statements.