I’m personally against nearly all discussion of “what should a Friendly AI do?” because friendliness is a very poorly understood concept
What would be a good way to advance in our understanding of that concept, then?
I don’t know. Discuss decision theory? Or ethics? Or something else? …I don’t think “what would friendly AI do?” (WWFAD) is a particularly useful line of thought, but I can’t think of something sufficiently analogous yet useful to replace it with.
What would be a good way to advance in our understanding of that concept, then?
I don’t know. Discuss decision theory? Or ethics? Or something else? …I don’t think “what would friendly AI do?” (WWFAD) is a particularly useful line of thought, but I can’t think of something sufficiently analogous yet useful to replace it with.