avturchin comments on Scientism vs. people

avturchin 18 Apr 2023 19:34 UTC
2 points
0
There are two important inputs in any mathematical AI alignment which need to be provided by humans or taken as un-said assumptions: who are “humans” and what is “human values” – or normatively important part of human values. AI can guess it, bit it needs AI Psychology and AI Sociology, which also could be biased.
Also see my short-form today about about reflexive stability of AI alignment.
- Roman Leventov 18 Apr 2023 20:02 UTC
  1 point
  0
  Parent
  It’s somewhere between the lines here:
  “Neuroscientific” ethics, e. g. the anatomy of human values, or preference science
  “Game-theoretic and evolutionary” ethics, e. g. Morality as Cooperation
  “Scale-free, physical” ethics (notably, interacting with naturalistic theories of consciousness rather than phenomenological, hermeneutic, and other “continental” studies and accounts of consciousness)
  Clearly, technical AI alignment cannot take a “human” with some formulaic definition as only or primary moral subjects and/or subjects to be aligned with. This itself would be unscientific. Rather, alignment should be based on some naturalistic theory of ethics, e.g., saying that moral subjectivity is proportional to integrated information Φ in the agent’s consciousness. The “values” are also determined scientifically, from the game-theoretic/evolutionary setup.
  So, a cyborg will also be a subject of alignment. But it also extends moral/alignment subjectivity to animals, of course.
  I think the term alignment subjectivity wasn’t used before, looks like a useful term, let’s coin it :)
  What links here?
  - H-JEPA might be technically alignable in a modified form by Roman Leventov (8 May 2023 23:04 UTC; 12 points)
  - Roman Leventov's comment on Types and Degrees of Alignment by Zvi (1 Jun 2023 13:57 UTC; 1 point)