M. Y. Zuo comments on Moral strategies at different capability levels

M. Y. Zuo 28 Jul 2022 13:38 UTC
0 points
0
If you don’t fully trust that agent, though, then it seems very tricky to reason about how much you should defer to them, because they may be manipulating you heavily. In such cases the approach that seems most robust is to diversify worldviews using a meta-rationality strategy which includes some strong principles.
This doesn’t seem to follow. Why wouldn’t the ‘strong principles’ also be a product of heavy manipulation?
- Richard_Ngo 4 Aug 2022 18:42 UTC
  2 points
  0
  Parent
  Strong principles tend to be harder to manipulate, because:
  a) Strong principles tend to be simple and clear; there’s not much room for cherrypicking them to produce certain outcomes.
  b) Principle-driven actions are less dependent on your specific beliefs.
  - M. Y. Zuo 4 Aug 2022 19:13 UTC
    0 points
    0
    Parent
    Regardless of how much harder they may be to manipulate, they can never be invulnerable. Which implies that given enough time, all principles, even the strongest, are subject to change.