I don’t think most people are trying to explicitly write down all human values and then tell them to an AI. Here are some more promising alternatives:
Tell an AI to “consult a human if you aren’t sure what to do”
Instead of explicitly trying to write down human values, learn them by example (by watching human actions, or reading books, or…)
I don’t think most people are trying to explicitly write down all human values and then tell them to an AI. Here are some more promising alternatives:
Tell an AI to “consult a human if you aren’t sure what to do”
Instead of explicitly trying to write down human values, learn them by example (by watching human actions, or reading books, or…)