Donatas Lučiūnas comments on A case for AI alignment being difficult

Donatas Lučiūnas 24 Mar 2024 22:44 UTC
−1 points
−2
Why do you think AGI is possible to align? It is known that AGI will prioritize self preservation and it is also known that unknown threats may exist (black swan theory). Why should AGI care about human values? It seems like a waste of time in terms of threats minimisation.
- jessicata 25 Mar 2024 5:26 UTC
  2 points
  0
  Parent
  Some possible AI architectures are structured as goal function optimization and by assumption that the human brain contains one or more expected utility maximizers, there is a human utility function that could be a possible AI goal. I’m not saying it’s likely.