Dirichlet-to-Neumann comments on How to think about and deal with OpenAI

Dirichlet-to-Neumann 11 Jan 2022 19:49 UTC
3 points
It seems to me that being open about what you are working on, and having a proven record of publishing/sharing critical informations, including weights, is a very good way to fight the arm race.

If you don’t know where your concurrent are, it is much more difficult to stop to think about alignment than to rush toward capacity first. If you know where your concurrent are, and if you know that you will be at worst a couple weeks or months late because they always publish and you will thus be able to catch up, you have much more slack to pursue alignment (or speculative research in general).

For the strategic arms reduction treaties signed between Russia and the USA, verification tools were a crucial part of the process, because you need to know what the other is doing to disarm:
https://en.wikipedia.org/wiki/START_I#Verification_tools
https://en.wikipedia.org/wiki/New_START#Monitoring_and_verification
- Daniel Kokotajlo 13 Jan 2022 17:56 UTC
  11 points
  Parent
  Yes, when we are getting really close to AGI it will be good for the leading contenders to share info with each other. Even then it won’t be a good idea for the leading contenders to publish publicly, because then there’ll be way more contenders! And now, when we are not really close to AGI, public publication accelerates research in general and thus shortens timelines, while also bringing more actors into the race.
  - Dirichlet-to-Neumann 15 Jan 2022 17:57 UTC
    1 point
    Parent
    Trust between partners do not happen overnight. You don’t suddenly begin sharing information with concurrents when the prize is in sight. We need a history of shared information to build upon, and now—when, as you said, AGI is not really close—is the good time to build it.
    Because if you don’t trust someone with GPT-3, you are certainly not going to trust them with an AGI.
    - Daniel Kokotajlo 15 Jan 2022 18:16 UTC
      4 points
      Parent
      Because if you don’t trust someone with GPT-3, you are certainly not going to trust them with an AGI.
      Choosing to not release GPT-3′s weights to the whole world doesn’t imply that you don’t trust DeepMind or Anthropic or whoever. It just implies that there exists at least one person in the world you don’t trust.
      I agree that releasing everything publicly would make it easier/more likely to release crucial things to key competitors when the time comes. Alas, the harms are big enough to outweigh this benefit, I think.