DanielFilan comments on Impact Measure Desiderata

DanielFilan 3 Sep 2018 21:48 UTC
LW: 4 AF: 3
0
AF

But if we allow this large of a thing to be called “low impact”, then we’re basically allowing anything, with some kind of clean-up aft.

Well, the clean-up afterward is pretty important and valuable! But I feel like you’re misunderstanding me—obviously I think that the initial ‘turning the Earth into diamond mines’ plan is pretty high-impact and shouldn’t be allowed absent detailed consultation with humans. I’m just saying that conditional on that plan being executed, the correct ‘low-impact’ thinking is in fact to implement the clean-up plan, and that therefore impact measures that discourage the clean-up plan are conceptually flawed.

If you’re saying we should actually have it be programmed related to these variables (or variables like those), I disagree.

I’m not sure about whether it should be programmed relative to intuitively natural-seeming variables (e.g. atmospheric nitrogen concentration and number of humans), but I think that as a result of its programming it should be conservative with respect to those variables.
- TurnTrout 3 Sep 2018 23:02 UTC
  LW: 6 AF: 2
  0
  AF Parent
  I’m just saying that conditional on that plan being executed, the correct ‘low-impact’ thinking is in fact to implement the clean-up plan, and that therefore impact measures that discourage the clean-up plan are conceptually flawed.
  I assert that the low impact choice should be basically invariant of when you’re instantiated, and that the low impact thing to do is to make a few diamond mines without much of a fuss. You shouldn’t need any clean-up, because there shouldn’t be a mess.
- Matthew Barnett 4 Sep 2018 3:33 UTC
  2 points
  0
  Parent
  The way I view it, the purpose of designing low-impact desiderata is that it might give us an idea of how to create a safety measure that doesn’t include any value-laden concepts.
  The issue with saying that the AI should offset certain variables, such as nitrogen concentrations, is that it seems like an arbitrary variable that needs to be offset. If you say, “Well, the AI should offset nitrogen, but not offset our neurons that now know about the AI’s existence” then you are introducing values into the discussion of low impact, which kind of defeats the purpose.
  Of course, the AI *should* offset the nitrogen, but whether it ought to be part of a low-impact measure is a separate question.