Xodarap comments on Counterarguments to the basic AI x-risk case

Xodarap 20 Oct 2022 0:01 UTC
3 points
1
Basic question: why would the AI system optimize for X-ness?
I thought Katja’s argument was something like:
1. Suppose we train a system to generate (say) plans for increasing the profits of your paperclip factory similar to how we train GANs to generate faces
2. Then we would expect those paperclip factory planners to have analogous errors to face generator errors
3. I.e. they will not be “eldritch”
The fact that you could repurpose the GAN discriminator in this terrifying way doesn’t really seem relevant if no one is in practice doing that?