Why assume AGI doesn’t have problems analogous to agency problems? It will have parts of itself that it doesn’t understand well, and which might go rogue.
I think that is mainly the point argued in more detail by jedharris. I think it would really be valuable to understand that mechanism in more detail.
Why assume AGI doesn’t have problems analogous to agency problems? It will have parts of itself that it doesn’t understand well, and which might go rogue.
I think that is mainly the point argued in more detail by jedharris. I think it would really be valuable to understand that mechanism in more detail.