Joe Carlsmith comments on Draft report on existential risk from power-seeking AI

Joe Carlsmith 7 May 2021 17:59 UTC
1 point
AF
Hi Koen,
Glad to hear you liked section 4.3.3. And thanks for pointing to these posts—I certainly haven’t reviewed all the literature, here, so there may well be reasons for optimism that aren’t sufficiently salient to me.
Re: black boxes, I do think that black-box systems that emerge from some kind of evolution/search process are more dangerous; but as I discuss in 4.4.1, I also think that the bare fact that the systems are much more cognitively sophisticated than humans creates significant and safety-relevant barriers to understanding, even if the system has been designed/mechanistically understood at a different level.
Re: “there is a whole body of work which shows that evolved systems are often power-seeking”—anything in particular you have in mind here?
- Koen.Holtman 7 May 2021 20:38 UTC
  LW: 1 AF: 1
  AF Parent
  
  Re: “there is a whole body of work which shows that evolved systems are often power-seeking”—anything in particular you have in mind here?
  
  For AI specific work, the work by Alex Turner mentioned elsewhere in this comment section comes to mind, as backing up a much larger body of reasoning-by-analogy work, like Omohundro (2008). But the main thing I had in mind when making that comment, frankly, was the extensive literature on kings and empires. In broader biology, many genomes/organisms (bacteria, plants, etc) will also tend to expand to consume all available resources, if you put them in an environment where they can, e.g. without balancing predators.