22: Paxlovid at the Pharmacy

simon 8 Jul 2022 0:31 UTC
2 points
0
DeepMind is already explicitly optimizing for what will be approved of rather than what would turn out well.
I think this is unfair since ultimately what “would turn out well” is going to be grounded in human preferences (e.g. CEV). The preference-based determinations they are doing are far from CEV but could be seen as a tech tree prerequisite to CEV, and is something that can be done now, which “what would turn out well” is not.
What links here?
- simon's comment on AI #7: Free Agency by Zvi (14 Apr 2023 3:33 UTC; 5 points)
- Noosphere89 8 Jul 2022 14:46 UTC
  −1 points
  −2
  Parent
  While I agree that on factual questions of policy human approval will diverge from giving them knowledge (It’s already happening with social media) moral questions have no ground truth, contra LessWrong and EA. That means there can only be approval and persuasion in perpetual memetic warfare against other cultures.
  - simon 9 Jul 2022 9:07 UTC
    1 point
    0
    Parent
    moral questions have no ground truth
    There is no ground truth to something as ambiguous as “moral questions” in general, but there is ground truth to, e.g. “do humans on average prefer policy A or policy B when this choice is presented to them?”. There is also ground truth to things like “do humans typically think A or B is more morally correct when this choice is presented to them?”, and even “Would this typical view be stable under a particular program of intelligence enhancement/reflection Z?” (though “is Z the best way to extrapolate humans?” does not have a ground truth).
    “What would more humans vote for?” does have a ground truth and predicting it seems to be a kind of thing that would be useful to get practice with human-modeling that could help develop something CEV-like in the future. Whereas “just do what’s right” does not, as you say, have a ground truth.
    That means there can only be approval and persuasion in perpetual memetic warfare against other cultures.
    If you mean people are going to continue to argue for different value systems, that seems fine to me? And you can still make a decision on what an AI is going to do (e.g. something CEV-like), even if there is no unambiguously correct choice.

simon comments on Covid 7/​7/​22: Paxlovid at the Pharmacy

simon comments on Covid 7/7/22: Paxlovid at the Pharmacy