How convenient that it is also nearly optimal at bringing you personal benefits.
Richard_Hollerith2
I doubt Retired was comparing you unfavorably to firefighters.
There is something very intemperate and one-sided about your writings about altruism. I would be much relieved if you would concede that in the scholarly, intellectual, scientific and ruling-administrative classes in the U.S., credible displays of altruistic feelings are among the most important sources of personal status (second only to scientific or artistic accomplishment and perhaps to social connections with others of high status). I agree with you that that situation is in general preferable to older situations in which wealth, connections to the ruling coalition, and ability to wield violence effectively (e.g., knights in shining armor) were larger sources of status, but that does not mean that altruism cannot be overdone.
I would be much relieve also if you would concede that your altruistic public statements and your hard work on a project with huge altruistic consequences have helped you personally much more than they have cost you. Particularly, most of your economic security derives from a nonprofit dependent on donations, and the kind of people who tend to donate are the kind of people who are easily moved by displays of altruism. Moreover, your altruistic public statements and your involvement in the altruistic project have allowed you to surround yourself with people of the highest rationality, educational accomplishments and ethical commitment. Having personal friendships with those sorts of people is extremely valuable. Consider that the human ability to solve problems is the major source of all wealth, and of course the people you have surrounded yourself with are the kind with the greatest ability to solve problems (while avoiding doing harm).
I love reality and try not to get caught up unnecessarily in whether something is of my mind or not of my mind.
I think the idea of self-improving AI is advertised too much. I would prefer that a person have to work harder or have to have more well-informed friends to learn about it.
But I’d been working on directly launching a Singularity movement for years, and it just wasn’t getting traction. At some point you also have to say, “This isn’t working the way I’m doing it,” and try something different.
Eliezer, do you still think the Singularity movement is not getting any traction?
(My personal opinion is it has too much traction.)
I’d take the paperclips, so long as it wasn’t running any sentient simulations.
A vast region of paperclips could conceivably after billions of years evolve into something interesting, so let us stipulate that the paperclipper wants the vast region to remain paperclips, so it remains to watch over its paperclips. Better yet, replace the paperclipper with a superintelligence that wants to pile all the matter it can reach into supermassive black holes; supermassive black holes with no ordinary matter nearby cannot evolved or be turned into anything interesting unless our model of fundamental reality is fundamentally wrong.
My question to Eliezer is, Would you take the supermassive black holes over the Babyeaters so long as the AI making the supermassive black holes is not running sentient simulations?
- 26 Jul 2012 22:21 UTC; 3 points) 's comment on The curse of identity by (
- 28 May 2013 12:03 UTC; 0 points) 's comment on Is a paperclipper better than nothing? by (
Avoiding transformation into Goal System Zero is a nearly universal instrumental value
Do you claim that that is an argument against goal system zero? But, Carl, the same argument applies to CEV—and almost every other goal system.
It strikes me as more likely that an agent’s goal system will transform into goal system zero than it will transform into CEV. (But surely the probability of any change or transformation of terminal goal happening is extremely small in any well engineered general intelligence.)
Do you claim that that is an argument against goal system zero? If so, I guess you also believe that the fragility of the values to which Eliezer is loyal is a reason to be loyal to them. Do you? Why exactly?
I acknowledge that preserving fragile things usually has instrumental value, but if the fragile thing is a goal, I am not sure that that applies, and even if it does, I would need to be convinced that a thing’s having instrumental value is evidence I should assign it intrinsic value.
Note that the fact that goal system zero has high instrumental utility is not IMHO a good reason to assign it intrinsic utility. I have not mentioned in this comment section what most convinces me to remain loyal to goal system zero; that is not what Robin Powell asked of me. (It just so happens that the shortest and quickest explanation I know of of goal system zero involves common instrumental values.)
OK, since this is a rationalist scientist community, I should have warned you about the eccentric scientific opinions in Garcia’s book. The most valuable thing about Garcia is that he spent 30 years communicating with whoever seemed sincere about the ethical system that currently has my loyalty, so he has dozens of little tricks and insights into how actual humans tend to go wrong when thinking in this region of normative belief space.
Whether an agent’s goal is to maximize the number of novel experiences experienced by agents in the regions of space-time under its control or whether the agent’s goal is to maximize the number of gold atom in the regions under its control, the agent’s initial moves are going to be the same. Namely, your priorities are going to look some like the following. (Which item you concentrate on first is going to depend on your exact circumstances.
(1) ensure for yourself an adequate supply of things like electricity that you need to keep on functioning;
(2) get control over your own “intelligence” which probably means that if you do not yet know how reliably to re-write your own source code, you acquire that ability;
(3a) make a survey of any other optimizing processes in your vicinity;
(3b) try to determine their goals and the extent to which those goals clash with your own;
(3c) assess their ability to compete with you;
(3d) when possible, negotiate with them to avoid negative-sum mutual outcomes;
(4a) make sure that the model of reality that you started out with is accurate;
(4b) refine your model of reality to encompass more and more “distant” aspects of reality, e.g., what are the laws of physics in extreme gravity? are the laws of physics and the fundamental constants the same 10 billion light years away as they are here? -- and so on.
Because those things I just listed are necessary regardless of whether in the end you want there to be lots of gold atoms or lots of happy humans, those things have been called “universal instrumental values” or “common instrumental values”.
The goal that currently has my loyalty is very simple: everyone should pursue those common instrumental values as an end in themselves. Specifically, everyone should do their best to maximize the ability of the space, time, matter and energy under their control (1) to assure itself (“it” being the space, time, matter, etc) a reliable supply of electricity and the other things it needs; (2) to get control over its own “intelligence”; and so on.
I might have mixed my statement or definition of that goal (which I call goal system zero) with arguments as to why that goal deserves the reader’s loyalty, which might have confused you.
I know it is not completely impossible for someone to understand because Michael Vassar successfully stated goal system zero in his own words. (Vassar probably disagrees with the goal, but that is firm evidence that he understands it.)
--and making deletions transparent to anyone interested in seeing them is not hard. For example, if a registered user of the open-source software behind Hacker News sets the SHOWDEAD bit in his or her profile, then from then on he or she will see unpublished submissions and comments in the place where they would have appeared if they had not been unpublished.
Eliezer seems to do most of the moderation
It does not seem that way from where I am standing: although I comment more on posts by Eliezer than on posts by Robin and although I am one of the most persistent critics of Eliezer’s plans and moral positions, none of my comments on Eliezer’s posts were unpublished, but 3 of my comments on Robin’s posts were.
Note that I do not think Robin did anything wrong. Contrary to what many commentators believe, unpublishing comments is necessary IMHO to keep the quality of the comments high enough that busy thoughtful people continue to read them. (In fact, if I thought there was a chance he might agree to do it, I would ask Robin to edit or moderate my own posts on my own blog.)
Richard, that’s a good point . . . - but then what should I believe about markets and investments, conditioned on scientific and technological progress having been slower than expected?
Well, if you have been misled into believing that scientific progress having been slower than expected entails economic production falling or stagnating, then you will tend to have assigned too high a value to investment strategies or hedging strategies that bet on the performance of the economy as a whole (e.g., shorting index funds). So, perhaps look for more specific bets. E.g., sell short shares in companies that produce “emergence engines” or “consciousness capacitors” or some other output demand for which will fall if progress stagnates in the area of science under discussion.
Mirror neurons and the human empathy-sympathy system play a central role in my definition of consciousness, sentience and personhood or rather my dissolving the question of what is consciousness, sentience and personhood.
Eliezer, there was rapid scientific progress in late-1600s Western Europe even though wealth per capita was vastly lower than current levels. Ditto scientific and technological progress in Victorian England. Could it be that the reason you believe that an economic slump would stall R & D is that the global public-opinion apparatus has fooled you and those you have trusted on this issue? “But it is necessary for scientific progress,” might have been a convenient false argument to convince certain sectors of public opinion whose are sceptical about other arguments about the need for pro-growth policies.
Buffy lives in Sunnydale, not Sunnyvale.
I think you’ve heard this one before: IMHO it has to do with the state in which reality “ends up” and has nothing to do with the subjective experiences of the intelligent agents in the reality. In my view, the greatest evil is the squandering of potential, and devoting the billion galaxies to fun is squandering the galaxies just as much as devoting them to experiments in pain and abasement is. In my view there is no important difference between the two. There would be—or rather there might be—an important difference if the fun produced by the billion galaxies is more useful than the pain and abasement—more useful, that is, for something other than having subjective experiences. But that possibility is very unlikely.
In the present day, a human having fun is probably more useful toward the kinds of ends I expect to be important than a human in pain. Actually the causal relationship between subject human experience and human effectiveness or human usefulness is poorly understood (by me) and probably quite complicated.
After the engineered explosion of engineeered intelligence, the humans are obsolete, and what replaces them is sufficiently different from the humans that my previous paragraph is irrelevant. In my view, there is no need to care whether or what subjective experiences the engineered intelligences will have.
What subjective experiences the humans will have is relevant only because the information helps us predict and control the effectiveness and the usefulness of the humans. We will have proofs of the correctness of the source code for the engineered intelligent agents, so there is no need to inquire about their subjective experiences.
For if all goes well, the question “What is fun?” shall determine the shape and pattern of a billion galaxies.
I object to most of the things Eliezer wants for the far future, but of all the sentences he has written lately, that is probably the one I object to most unequivocally. A billion galaxies devoted to fun does not leave Earth-originating intelligence at lot to devote to things that might be actually important.
That is my dyspeptic two cents.
Not wanting to be in a rotten mood keeps me from closely reading this series on fun and the earlier series on sentience or personhood, but I have detected no indication of how Eliezer would resolve a conflict between the terminal values he is describing. If for example, he learned that the will of the people, oops, I mean, the collective volition, oops, I mean, the coherent extrapolated volition does not want fun, would he reject the coherent extrapolated volition or would he resign himself to a future of severely submaximal quantities of fun?
Julian Morrison, thanks for that hypothesis!
Modifying yourself that way would just demonstrate that you value the means of fun more than the ends. Even if you could make that modification, would you?
Yes, Ben Jones, I sincerely would. (I also value the means of friendship, love, sex, pleasure, health, wealth, security, justice, fairness, my survival and the survival of my friends and loved ones more than the ends. I have a very compact system of terminal values. I.e., very few ultimate ends.)
I am fully aware that my saying that I value friendship as a means to an end rather than an end in itself handicaps me in the eyes of prospective friends. Ditto love and prospective lovers. But I am not here to make friends or find a lover.
People have a bias for people with many terminal values. Take for example a person who refuses to eat meat because doing so would participate in the exploitation of farm animals. My hypothesis is that that position helps the person win friends and lovers because prospective friends and lovers think that if the person is that scrupulous towards a chicken he has never met then he is more likely than the average person to treat his human friends scrupulously and non-exploitatively. A person with many terminal values is trusted more than a person with with fewer and is rarely called on to explain the contradictions in his system of terminal values.
There are commercials for cars in which the employees of the car company are portrayed as holding reliable cars with zero defects as a terminal value. Or great-tasting beer as a terminal value. And of course advertiser tend to keep using a pitch only if it helps sell more cars or beer. It is my hope that some of the readers of these words realize that there is something wrong with an agent of general intelligence (a human in this case or an organization composed of humans) holding great-tasting beer as a terminal value.
I invite the reader to believe with me that Occam’s Razor—that everything else being equal, a simple system of beliefs is to be preferred over a complex system—applies to normative beliefs as well as positive beliefs. Moreover, since there is nothing that counts as evidence for or against a normative belief, a system of normative beliefs should not grow in complexity as the agent gathers evidence from its environment the way a system of positive beliefs does.
Finally, if Vladimir Slepnev has written up his ethical beliefs, I ask him to send them to me.
For someone to use these pages to promote their online store would be bad, obviously.
But it is natural for humans to pursue fame, reputation, adherents and followers as arduously as humans pursue commercial profit.
And the pursuit of these things can detract from a public conversation for the same reason that pursuit of commericial profit can.
And of course a common component of a bid for fame, reputation, adherents or followers is claims of virtues.
I am not advocating as a standard the avoidance of all claims of virtues because sometimes they are helpful.
But a claim of a virtue when there is no way for the reader to confirm the presence of the virtue seems to have all the bad effects of such a claim without any of the good effects.
I think sacrifice and avoiding self-benefit came up in this conversation because they are the usual ways in which readers confirm claims of altruistic virtue.