Shmi comments on We can do better than DoWhatIMean (inextricably kind AI)

Shmi 19 Aug 2023 21:35 UTC
2 points
0
In other words, the plan has two steps:
1. Make a machine that can do literally anything
2. Configure it to do good things
I don’t think this is the plan? The hope is that, as capabilities grow, so does alignment, whatever this “alignment” thing is. The reality is different, of course.
- lemonhope 20 Aug 2023 0:09 UTC
  3 points
  2
  Parent
  Edited post to rename “intrinsically aligned AI” to “intrinsically kind AI” for clarity. As I understand it, the hope is to develop capability techniques and control techniques in parallel. But there’s no major plan I know of to have a process for developing capabilities that are hard-linked to control/kindness/whatever in a way you can’t easily remove. (I have heard an idea or two though and am planning on writing a post about it soon.)