Dakara comments on What are the best arguments for/against AIs being “slightly ‘nice’”?

Dakara 3 Dec 2024 10:12 UTC
1 point
0
Do you think that the scalable oversight/iterative alignment proposal that we discussed can get us to the necessary amount of niceness to make humans survive with AGI?
- Noosphere89 3 Dec 2024 16:13 UTC
  4 points
  2
  Parent
  My answer is basically yes.
  
  I was only addressing the question “If we basically failed at alignment, or didn’t align the AI at all, but had a very small amount of niceness, would that lead to good outcomes?”