TheAncientGeek comments on On Terminal Goals and Virtue Ethics

TheAncientGeek 24 Jun 2014 16:49 UTC
−2 points
I don’t see why the search algorithm would need to be self modifying.

I don’t see why you would be searching for stability as opposed to friendliNess. Human testers can judge friendliness directly.
- Mark_Friedenbach 24 Jun 2014 16:57 UTC
  2 points
  Parent
  It’s how you draw your system box. Evolutionary search is equivalent to a self-modifying program, if you think of the whole search process as the program. The same issues apply.
  
  I think the sequences do a good job at demolishing the idea that human testers can possibly judge friendliness directly, so long as the AI operates as a black box. If you have a debug view into the operation of the AI that is a different story, but then you don’t need friendliness anyway.
  - TheAncientGeek 25 Jun 2014 13:32 UTC
    −1 points
    Parent
    If I draw a box around the selection algorithm and find there is nothing self modifying inside …where’s the circularity?