Yitz comments on Essay competition on the Automation of Wisdom and Philosophy — $25k in prizes

Yitz 16 Apr 2024 19:11 UTC
4 points
2

imagine an AI system which wipes out humans in order to secure its own power, and later on reflection wishes it hadn’t; a wiser system might have avoided taking that action in the first place

I’m not confident this couldn’t swing just as easily (if not more so) in the opposite direction—a wiser system with unaligned goals would be more dangerous, not less. I feel moderately confident that wisdom and human-centered ethics are orthogonal categories, and being wiser therefore does not necessitate greater alignment.

On the topic of the competition itself, are contestants allowed to submit multiple entries?
- owencb 16 Apr 2024 20:56 UTC
  6 points
  4
  Parent
  Multiple entries are very welcome!
  [With some kind of anti-munchkin caveat. Submitting your analyses of several different disjoint questions seems great; submitting two versions of largely the same basic content in different styles not so great. I’m not sure exactly how we’d handle it if someone did the latter, but we’d aim for something sensible that didn’t incentivise people to have been silly about it.]
- owencb 16 Apr 2024 21:06 UTC
  4 points
  2
  Parent
  It’s a fair point that wisdom might not be straightforwardly safety-increasing. If someone wanted to explore e.g. assumptions/circumstances under which it is vs isn’t, that would certainly be within scope for the competition.