You are right that there is a potential “mind crime” ethical problem above.
One could argue that, to build an advanced AGI that avoids “mind crime”, we can equip the AGI with a highly accurate predictor, but this predictor should be implemented in such a way that it is not actually a highly accurate simulator. I am not exactly sure how one could formally define the constraint of ‘not actually being a simulator’. Also, maybe such an implementation constraint will fundamentally limit the level of predictive accuracy (and therefore intelligence) that can be achieved. Which might be a price we should be willing to pay.
Mathematically speaking, if I want to make AGI safety framework correctness proofs, I think it is valid to model the ‘highly accurate predictor that is not a simulator’ box inside the above AGI as a box with an input-output behavior equivalent to that of a highly accurate simulator. This is a very useful short-cut when making proofs. But it also means that I am not sure how one should define ‘not actually being a simulator’.
In the context of my problem statement, a PAL with high predictive accuracy is something that is in scope to consider. This does not mean that we should or must design a real PAL in this way.
An AGI that exceeds humans in its ability to predict human responses might be a useful tool, e.g. to a politician who wants to make proposals to resolve long-lasting human conflicts. But definitely this is a tool that could also be used for less ethical things.
You are right that there is a potential “mind crime” ethical problem above.
One could argue that, to build an advanced AGI that avoids “mind crime”, we can equip the AGI with a highly accurate predictor, but this predictor should be implemented in such a way that it is not actually a highly accurate simulator. I am not exactly sure how one could formally define the constraint of ‘not actually being a simulator’. Also, maybe such an implementation constraint will fundamentally limit the level of predictive accuracy (and therefore intelligence) that can be achieved. Which might be a price we should be willing to pay.
Mathematically speaking, if I want to make AGI safety framework correctness proofs, I think it is valid to model the ‘highly accurate predictor that is not a simulator’ box inside the above AGI as a box with an input-output behavior equivalent to that of a highly accurate simulator. This is a very useful short-cut when making proofs. But it also means that I am not sure how one should define ‘not actually being a simulator’.
Do we need it to predict people with high accuracy? Humans do well enough at our level of prediction.
In the context of my problem statement, a PAL with high predictive accuracy is something that is in scope to consider. This does not mean that we should or must design a real PAL in this way.
An AGI that exceeds humans in its ability to predict human responses might be a useful tool, e.g. to a politician who wants to make proposals to resolve long-lasting human conflicts. But definitely this is a tool that could also be used for less ethical things.