Charbel-Raphaël comments on AGI Safety FAQ / all-dumb-questions-allowed thread

Charbel-Raphaël 16 Jun 2022 20:42 UTC
2 points
0
Ok. But don’t you think “reverse engineering human instincts” is a necessary part of the solution?

My intuition is that value is fragile, so we need to specify it. If we want to specify it correctly, either we learn it or we reverse engineer it, no?
- Yonatan Cale 17 Jun 2022 21:43 UTC
  1 point
  0
  Parent
  But don’t you think “reverse engineering human instincts” is a necessary part of the solution?
  I don’t know, I don’t have a coherent idea for a solution. Here’s one of my best ideas (not so good).
  Yudkowsky split up the solutions in his post, see point 24. The first sub-bullet there is about inferring human values.
  Maybe someone else will have different opinions