paulfchristiano comments on Yudkowsky and Christiano discuss “Takeoff Speeds”

paulfchristiano 29 Nov 2021 7:43 UTC
LW: 18 AF: 11
0
AF
I don’t care about whether the AI is open-sourced (I don’t expect anyone to publish the weights even if they describe their method) and I’m not that worried about our ability to arbitrate overfitting.
Ajeya suggested that I clarify: I’m significantly more impressed by an AI getting a gold medal than getting a bronze, and my 4% probability is for getting a gold in particular (as described in the IMO grand challenge). There are some categories of problems that can be solved using easy automation (I’d guess about 5-10% could be done with no deep learning and modest effort). Together with modest progress in deep learning based methods, and a somewhat serious effort, I wouldn’t be surprised by people getting up to 20-40% of problems. The bronze cutoff is usually ³⁄₆ problems, and the gold cutoff is usually ⁵⁄₆ (assuming the AI doesn’t get partial credit). The difficulty of problems also increases very rapidly for humans—there are often 3 problems that a human can do more-or-less mechanically.
I could tighten any of these estimates by looking at the distribution more carefully rather than going off of my recollections from 2008, and if this was going to be one of a handful of things we’d bet about I’d probably spend a few hours doing that and some other basic digging.
- paulfchristiano 2 Dec 2021 7:12 UTC
  LW: 16 AF: 9
  0
  AF Parent
  I looked at a few recent IMOs to get better calibrated. I think the main update is that I significantly underestimated how many years you can get a gold with only ⁴⁄₆ problems.
  For example I don’t have the same “this is impossible” reaction about IMO 2012 or IMO 2015 as about most years. That said, I feel like they do have to get reasonably lucky with both IMO content and someone has to make a serious and mostly-successful effort, but I’m at least a bit scared by that. There’s also quite often a geo problem as 3 or 6.
  Might be good to make some side bets:
  - Conditioned on winning I think it’s only maybe 20% probability to get all 6 problems (whereas I think you might have a higher probability on jumping right past human level, or at least have 50% on 6 vs 5?).
  - Conditioned on a model getting 3+ problems I feel like we have a pretty good guess about what algorithm will be SOTA on this problem (e.g. I’d give 50% to a pretty narrow class of algorithms with some uncertain bells and whistles, with no inside knowledge). Whereas I’d guess you have a much broader distribution.
  But more useful to get other categories of bets. (Maybe in programming, investment in AI, economic impact from robotics, economic impact from chatbots, translation?)
  - paulfchristiano 2 Dec 2021 17:16 UTC
    LW: 25 AF: 11
    0
    AF Parent
    Going through previous ten IMOs, and imagining a very impressive automated theorem prover, I think
    2020 - unlikely, need ⁵⁄₆ and probably can’t get problems 3 or 6. Also good chance to mess up at 4 or 5
    2019 - tough but possible, 3 seems hard but even that is not unimaginable, 5 might be hard but might be straightforward, and it can afford to get one wrong
    2018 - tough but possible, 3 is easier for machine than human but probably still hard, 5 may be hard, can afford to miss one
    2017 - tough but possible, 3 looks out of reach, 6 looks hard but not sure about that, 5 looks maybe hard, 1 is probably easy. But it can miss 2, which could happen.
    2016 - probably not possible, 3 and 6 again look hard, and good chance to fail on 2 and 5, only allowed to miss 1
    2015 - seems possible, 3 might be hard but like 50-50 it’s simple for machine, 6 is probably hard, but you can miss 2
    2014 - probably not possible, can only miss 1, probably miss one of 2 or 5 and 6
    2013 - probably not possible, 6 seems hard, 2 seems very hard, can only miss 1
    2012 - tough but possible, 6 and 3 look hard but you can miss 2
    2011 - seems possible, allowed to miss two and both 3 and 6 look brute-forceable
    Overall this was much easier than I expected. ⁴⁄₁₀ seem unlikely, ⁴⁄₁₀ seem tough but possible, ²⁄₁₀ I can imagine a machine doing it. There are a lot of problems that look really hard, but there are a fair number of tests where you can just skip those.
    That said, even to get the possible ones you do need to be surprisingly impressive, and that’s getting cut down by like 25-50% for a solvable test. That said, they get to keep trying (assuming they get promising results in early years) and eventually they will hit one of the easier years.
    It also looks fairly likely to me that if one of DeepMind or OpenAI tries seriously they will be able to get an HM with a quite reasonable chance at bronze, and this is maybe enough of a PR coup to motivate work, and then it’s more likely there will be a large effort subsequently to finish the job or to opportunistically take advantage of an easy test.
    Overall I’m feeling bad about my 4%, I deserve to lose some points regardless but might think about what my real probability is after looking at tests (though I was also probably moved by other folks in EA systematically giving higher estimates than I did).
    What links here?
    paulfchristiano's comment on OpenAI Solves (Some) Formal Math Olympiad Problems by Michaël Trazzi (4 Feb 2022 17:22 UTC; 8 points)
    - gwern 3 Dec 2021 0:19 UTC
      LW: 15 AF: 7
      0
      AF Parent
      What do you think of Deepmind’s new whoop-de-doo about doing research-level math assisted by GNNs?
      - paulfchristiano 6 Dec 2021 18:22 UTC
        LW: 4 AF: 2
        0
        AF Parent
        Not surprising in any of the ways that good IMO performance would be surprising.