Vladimir_Nesov comments on Bay area LW meet-up

Vladimir_Nesov 11 Nov 2009 14:29 UTC
0 points

OK, so you’re saying that FAI is not hard because you have to formalize human morality, it’s hard because you have to have a system for formalizing things in general?

This also seems to be the only way out. If human values are too complex to reimplement manually (which seems to be the case), you have to create a tool with the capability to do that automatically. And once you have that tool, cutting angles on the content of human values would just be useless: the tool will work on the whole thing. And you can’t cut corners on the tool itself, like you can’t have a computer with only randomly sampled 50% of circuitry.
- Nick_Tarleton 12 Nov 2009 2:20 UTC
  1 point
  Parent
  
  If human values are too complex to reimplement manually (which seems to be the case), you have to create a tool with the capability to do that automatically. And once you have that tool
  
  You’re right, of course, but the point at hand is what to do before you have that tool.
  - Vladimir_Nesov 12 Nov 2009 2:29 UTC
    0 points
    Parent
    Work towards developing it?
- John_Maxwell 12 Nov 2009 2:14 UTC
  1 point
  Parent
  
  If human values are too complex to reimplement manually (which seems to be the case), you have to create a tool with the capability to do that automatically.
  
  You can’t prove it works before running it in that case. Human values are not some kind of fractal pattern, where something complicated can be generated according to simple rules. In your proposal, the AI would have to learn human values somehow, which means it will have some indicator or another that it’s getting closer to human values (e.g. smiling humans), which will then be susceptible to wire-heading. Having the AI make inferences from a large corpus of human writing might work.