TAG comments on AGI Safety FAQ / all-dumb-questions-allowed thread

TAG 11 Jun 2022 18:41 UTC
3 points
0
Why would even a superintelligent AI want to modify its utility function?
1. For whatever reasons humans do.
2. To achieve some mind of logical consistency (CF CEV).
3. It can’t help it (for instance Loebian obstacles prevent it ensuring goal stability over self improvement).
- lc 12 Jun 2022 1:17 UTC
  2 points
  −3
  Parent
  Humans don’t “modify their utility function”. They lack one in the first place, because they’re mostly adaption-executors. You can’t expect an AI with a utility function to be contradictory like a human would. There are some utility functions humans would find acceptable in practice, but that’s different, and seems to be the source of a bit of confusion.
  - TAG 15 Jun 2022 14:48 UTC
    1 point
    0
    Parent
    I don’t have strong reasons to be believe all AIs have UFs in the formal sense, so the ones that don’t would cover “for the reasons humans do”. The idea that any AI is necessarily consistent is pretty naive too. You can get a GTP to say nonsensical things, for instance, because it’s training data includes a lot of inconsitencies,