cubefox comments on A Game About AI Alignment (& Meta-Ethics): What Are the Must Haves?

cubefox 5 Sep 2022 12:52 UTC
1 point
0

Ethical truths are probably different from empirical truths. An advanced AI may learn empirical truths on its own from enough data, but it seems unlikely that it will automatically converge on the ethical truth. Instead, it seems that any degree of intelligence can be combined with any kind of goal. (Orthogonality Thesis)

I think the main point of the orthogonality thesis is less about an advanced AI not being able to figure out the true ethics, but the AI not being motivated to be ethical in this way even if it figures out the correct theory. If there is a true moral theory and the orthogonality thesis is true, the thesis of moral internalism (true moral beliefs are intrinsically motivating) is false. See here https://arbital.com/p/normative_extrapolated_volition/ section “Unrescuability of moral internalism”.
- JonathanErhardt 5 Sep 2022 13:05 UTC
  1 point
  0
  Parent
  Good point, I see what you mean. I think we could have 2 distinct concepts of “ethics” and 2 corresponding orthogonality theses:
  1. Concept “ethics1” requires ethics to be motivational. Some set of rules can only be the true ethics if, necessarily, everyone who knows them is motivated to follow them. (I think moral internalist probably use this concept?)
  2. Concept “ethics2” doesn’t require some set of rules to be motivational to be the correct ethics.
  The orthogonality thesis for 1 is what I mentioned: Since there are (probably) no rules that necessarily motivate everyone who knows them, the AI would not find the true ethical theory.
  
  The orthogonality thesis for 2 is what you mention: Even if the AI finds it, it would not necessarily be motivated by it.
  - cubefox 5 Sep 2022 14:00 UTC
    1 point
    0
    Parent
    Exactly!