VCM

Karma: 13

www.sophia.de

VCM 27 Jul 2021 9:13 UTC
1 point
in reply to: Vaniver’s comment on: Is the argument that AI is an xrisk valid?
One more consideration about “instrumental intelligence”: we left that somewhat under-defined, more like “if I had that utility function, what would I do?” … but it is not clear that this image of “me in the machine” captures what a current or future machine would do. In other words, people who use instrumental intelligence for an image of AI owe us a more detailed explanation of what that would be, given the machines we are creating—not just given the standard theory of rational choice.

VCM 26 Jul 2021 14:50 UTC
1 point
in reply to: TAG’s comment on: Is the argument that AI is an xrisk valid?
Thanks, it’s useful to bring these out—though we mention them in passing. Just to be sure: We are looking at the XRisk thesis, not at some thesis that AI can be “dangerous”, as most technologies will be. The Omhundro-style escalation is precisely the issue in our point that instrumental intelligence is not sufficient for XRisk.

VCM 26 Jul 2021 8:29 UTC
1 point
in reply to: TAG’s comment on: Is the argument that AI is an xrisk valid?
… we aren’t trying to prove the absence of XRisk, we are probing the best argument for it?

VCM 25 Jul 2021 14:08 UTC
1 point
on: Is the argument that AI is an xrisk valid?
We tried to find the strongest argument in the literature. This is how we came up with our version:
“
Premise 1: Superintelligent AI is a realistic prospect, and it would be out of human control. (Singularity claim)
Premise 2: Any level of intelligence can go with any goals. (Orthogonality thesis)
Conclusion: Superintelligent AI poses an existential risk for humanity
”
====
A more formal version with the same propositions might be this:
1. IF there is a realistic prospect that there will be a superintelligent AI system that is a) out of human control and b) can have any goals, THEN there is existential risk for humanity from AI
2. There is a realistic prospect that there will be a superintelligent AI system that is a) out of human control and b) can have any goals
->
3. There is existential risk for humanity from AI
====
And now our concern is whether a superintelligence can be both a) and b) - given that a) must be understood in a way that is strong enough to generate existential risk, including “widening the frame”, and b) must be understood as strong enough to exclude reflection on goals. Perhaps that will work only if “intelligent” is understood in two different ways? Thus Premise 2 is doubtful.

VCM 25 Jul 2021 13:27 UTC
−1 points
in reply to: samshap’s comment on: Delta variant: we should probably be re-masking
Even if that is true, you would still get a) a lot of sickness & suffering, and b) infect a lot of other people (who infect further). So some people would be seriously ill and some would die as a result of this experiment.

VCM 25 Jul 2021 7:38 UTC
1 point
in reply to: Rafael Harth’s comment on: Is the argument that AI is an xrisk valid?
Can one be a moral realist and subscribe to the orthogonality thesis? In which version of it? (In other words, does one have to reject moral realism in order to accept the standard argument for XRisk from AI? We should better be told! See section 4.1)

VCM 25 Jul 2021 7:33 UTC
1 point
in reply to: Donald Hobson’s comment on: Is the argument that AI is an xrisk valid?
But reasoning about morality? Is that a space with logic or with anything goes?

VCM 25 Jul 2021 7:32 UTC
7 points
in reply to: Daniel Kokotajlo’s comment on: Is the argument that AI is an xrisk valid?
Thanks. We are actually more modest. We would like to see a sound argument for XRisk from AI and we investigate what we call ‘the standard argument’; we find it wanting and try to strengthen it, but we fail. So there is something amiss. In the conclusion we admit “we could well be wrong somewhere and the classical argument for existential risk from AI is actually sound, or there is another argument that we have not considered.”
I would say the challenge is to present a sound argument (valid + true premises) or at least a valid argument with decent inductive support for the premises. Oddly, we do not seem to have that.

VCM 25 Jul 2021 7:23 UTC
1 point
in reply to: TAG’s comment on: Is the argument that AI is an xrisk valid?
… plus we say that in the paper :)

VCM 24 Jul 2021 15:50 UTC
1 point
in reply to: Steven Byrnes’s comment on: Is the argument that AI is an xrisk valid?
- Maximal overall utility is better than minimal overall utility. Not sure what that means. The NPCs in this simulation don’t have “utility”. The real humans in the secret prison do.
This should have been clearer. We meant this in Bentham’s good old way: minimal pain and maximal pleasure. Intuitively: A world with a lot of pleasure (in the long run) is better than a world with a lot of pain. - You don’t need to agree, you just need to agree that this is worth considering, but on our interpretation the orthogonality thesis says that one cannot consider this.

VCM 24 Jul 2021 15:37 UTC
2 points
in reply to: frontier64’s comment on: Is the argument that AI is an xrisk valid?
Thanks for this. Indeed, we have no theory of goals here and how the relate, maybe they must be in a hierarchy, as you suggest. And there is a question, then, whether there must be some immovable goal or goals that would have to remain in place in order to judge anything at all. This would constitute a theory of normative judgment … which we don’t have up our sleeves :)

VCM 24 Jul 2021 15:34 UTC
1 point
in reply to: Jan Czechowski’s comment on: Is the argument that AI is an xrisk valid?
We suggest that such instrumental intelligence would be very limited.
In fact, there is a degree of generality here and it seems one needs a fairly high degree to get to XRisk, but that high degree would then exclude orthogonality.

VCM 24 Jul 2021 15:32 UTC
1 point
in reply to: Vaniver’s comment on: Is the argument that AI is an xrisk valid?
Yes, that means “this argument”.

VCM 24 Jul 2021 15:27 UTC
1 point
in reply to: Steven Byrnes’s comment on: Is the argument that AI is an xrisk valid?
Thanks for the ‘minor’ point, which is important: yes, we meant definitely out of human control. And perhaps that is not required, so the argument has a different shape.
Our struggle was to write down a ‘standard argument’ in such a way that it is clear and its assumptions come out—and your point adds to this.

VCM 24 Jul 2021 15:25 UTC
1 point
in reply to: Donald Hobson’s comment on: Is the argument that AI is an xrisk valid?
Here we get to a crucial issue, thanks! If we do assume that reflection on goals does occur, do we assume that the results have any resemblance with human reflection on morality? Perhaps there is an assumption about the nature of morality or moral reasoning in the ‘standard argument’ that we have not discussed?

VCM 24 Jul 2021 15:22 UTC
3 points
in reply to: Daniel Kokotajlo’s comment on: Is the argument that AI is an xrisk valid?
We do not say that there is no XRisk or no XRisk from AI.

VCM 24 Jul 2021 15:21 UTC
1 point
in reply to: TAG’s comment on: Is the argument that AI is an xrisk valid?
… well, one might say we assume that if there is ‘reflection on goals’, the results are not random.

VCM 24 Jul 2021 15:20 UTC
1 point
in reply to: jimrandomh’s comment on: Is the argument that AI is an xrisk valid?
apologies, I don’t recognise the paper here :)

VCM 24 Jul 2021 15:19 UTC
1 point
in reply to: Steven Byrnes’s comment on: Is the argument that AI is an xrisk valid?
We tried to frame the discussion internally, i.e. without making additional assumptions that people may or may not agree with (e.g. moral realism). If we did the job right, the assumptions made in the argument are in the ‘singularity claim’ and the ‘orthogonality thesis’ - and there the dilemma is that we need an assumption in the one (general intelligence in the singularity claim) that we must reject in the other (the orthogonality thesis).
What we do say (see figure 1) is that two combinations are inconsistent:
a) general intelligence + orthogonality
b) instrumental intelligence + existential risk
So if one wants to keep the ‘standard argument’, one would have to argue that one of these two, a) or b) are fine.

VCM 24 Jul 2021 15:12 UTC
1 point
in reply to: Vaniver’s comment on: Is the argument that AI is an xrisk valid?
Is this ‘standard argument’ valid? We only argue that is problematic.
If this argument is invalid, what would a valid argument look like? Perhaps with a ‘sufficient probability’ of high risk from instrumental intelligence?