How do its beliefs evolve? In particular, what priors and anthropic principles does it use? (epistemology)
I just want to note that there is an underlying assumption there, namely that for pragmatics (how-to act in the world?) and morals (what should be done?) require getting our map of the world and tools for this map (epistemology) right.
This is a common assumption on Lesswrong because of the map territory jargon. But keep in mind that as seen from the inside an algorithm can be extremely different from the world and still function perfectly well in it.
Put in a different manner: There are many surviving and thriving biological entities out there who have very epistemically poor models of the world, and still get away with it.
There are also some very good humans, throughout history whose confused or wrong model of how the world works caused them to act in a moral way.
How do its beliefs evolve? In particular, what priors and anthropic principles does it use? (epistemology)
I just want to note that there is an underlying assumption there, namely that for pragmatics (how-to act in the world?) and morals (what should be done?) require getting our map of the world and tools for this map (epistemology) right.
This is a common assumption on Lesswrong because of the map territory jargon. But keep in mind that as seen from the inside an algorithm can be extremely different from the world and still function perfectly well in it.
Put in a different manner: There are many surviving and thriving biological entities out there who have very epistemically poor models of the world, and still get away with it.
There are also some very good humans, throughout history whose confused or wrong model of how the world works caused them to act in a moral way.