For purposes of the linked submission, we get the correct utility function after the hypercomputer is done running. Ideally, we would save each utility function’s preferred AI that then select the one that was preferred by the correct one, but we only have 1TB of space. Therefore we get almost all of them to agree on a parametrized solution.
AUP over all utility functions might therefore make sense for a limited environment, such as the interior of a box?
Do you mean that utility functions that do not want the AI to seize power at much cost are rare enough that the terabyte that has the highest approval rating will not be approved by a human utility function?
For purposes of the linked submission, we get the correct utility function after the hypercomputer is done running. Ideally, we would save each utility function’s preferred AI that then select the one that was preferred by the correct one, but we only have 1TB of space. Therefore we get almost all of them to agree on a parametrized solution.
AUP over all utility functions might therefore make sense for a limited environment, such as the interior of a box?
Do you mean that utility functions that do not want the AI to seize power at much cost are rare enough that the terabyte that has the highest approval rating will not be approved by a human utility function?