AI Research Considerations for Human Existential Safety (ARCHES)

habryka9 Jul 2020 2:49 UTC

LW: 60 AF: 24

Andrew Critch’s (Academian) and David Krueger’s review of 29 AI (existential) safety research directions, each with an illustrative analogy, examples of current work and potential synergies between research directions, and discussion of ways the research approach might lower (or raise) existential risk.

What links here?

High Reliability Orgs, and AI Companies by Raemon (4 Aug 2022 5:45 UTC; 86 points)

habryka9 Jul 2020 2:49 UTC

LW: 60 AF: 24

7 comments1 min readLW link

AI World Optimization

Ben Pace 9 Jul 2020 4:03 UTC
LW: 5 AF: 2
AF
Wow, this is long, and seems pretty detailed and interesting. I’d love to see someone write a selection of key quotes or a summary.
- Rohin Shah 9 Jul 2020 6:41 UTC
  LW: 16 AF: 9
  AF Parent
  Highlighted in AN #103 with a summary, though it didn’t go into the research directions (because it would have become too long, and I thought the intro + categorization was more important on average).
  - Ben Pace 9 Jul 2020 7:39 UTC
    LW: 2 AF: 1
    AF Parent
    Thank you!
- David Scott Krueger (formerly: capybaralet) 19 Sep 2020 4:54 UTC
  LW: 4 AF: 3
  AF Parent
  There is now also an interview with Critch here: https://futureoflife.org/2020/09/15/andrew-critch-on-ai-research-considerations-for-human-existential-safety/
  - Ben Pace 19 Sep 2020 23:31 UTC
    LW: 2 AF: 1
    AF Parent
    I listened to this yesterday! Was quite interesting, I’m glad I listened to it.
MaxRa 30 Oct 2020 10:15 UTC
1 point
Really enjoyed reading this. The section on “AI pollution” leading to a loss of control about the development of prepotent AI really interested me.
Avoiding [the risk of uncoordinated development of Misaligned Prepotent AI] calls for well-deliberated and respected assessments of the capabilities of publicly available algorithms and hardware, accounting for whether those capabilities have the potential to be combined to yield MPAI technology. Otherwise, the world could essentially accrue “AI-pollution” that might eventually precipitate or constitute MPAI.
- I wonder how realistic it is to predict this e.g. would you basically need the knowledge to build it to have a good sense for that potential?
- I also thought the idea of AI orgs dropping all their work once the potential for this concentrates in another org is relevant here—are there concrete plans when this happens?
- Are there discussion about when AI orgs might want to stop publishing things? I only know of MIRI, but would they advise others like OpenAI or DeepMind to follow their example?
FactorialCode 9 Jul 2020 16:59 UTC
1 point
Nitpick, is there a reason why the margins are so large?