Charlie Steiner comments on The Catastrophic Convergence Conjecture

Charlie Steiner 17 Feb 2020 5:08 UTC
LW: 2 AF: 1
AF
How much are you thinking about stability under optimization? Most objective catastrophes are also human catastrophes. But if a powerful agent is trying to achieve some goal while avoiding objective catastrophes, it seems like it’s still incentivized to dethrone humans—to cause basically the most human-catastrophic thing that’s not objective-catastrophic.
- TurnTrout 17 Feb 2020 5:20 UTC
  LW: 2 AF: 1
  AF Parent
  I’m not thinking of optimizing for “not an objective catastrophe” directly—it’s just a useful concept. The next post covers this.