Noosphere89 comments on Response to Katja Grace’s AI x-risk counterarguments

Noosphere89 23 Oct 2022 21:37 UTC
LW: 2 AF: 2
3
AF

EtA: I am still more concerned about “not enough samples to learn human preferences” than ELK or inner optimization type failures. This seems to be a fairly unpopular view, and I haven’t scrutinized it too much (but would be interested to discuss it cooperatively).

This is a crux for me, as it is why I don’t think slow takeoff is good by default. I think deceptive alignment is the default state barring interpretability efforts that are strong enough to actually detect mesa-optimizers or myopia. Yes, Foom is probably not going to happen, but in my view that doesn’t change much regarding risk in total.
- David Scott Krueger (formerly: capybaralet) 23 Oct 2022 22:58 UTC
  LW: 2 AF: 1
  0
  AF Parent
  TBC, “more concerned” doesn’t mean I’m not concerned about the other ones… and I just noticed that I make this mistake all the time when reading people say they are more concerned about present-day issues than x-risk....… hmmm........