Mitchell_Porter

Karma: 8,060

Mitchell_Porter 22 May 2024 22:21 UTC
2 points
0
in reply to: robo’s comment on: robo’s Shortform
There’s a paper from ten years ago, “Testing Theories of American Politics: Elites, Interest Groups, and Average Citizens”, which says that public opinion has very little effect on government, compared to the opinion of economic elites. That might be a start in figuring out what you can and can’t do with that 40%.

Mitchell_Porter 20 May 2024 13:57 UTC
0 points
0
in reply to: Peter Merel’s comment on: How I Learned To Stop Worrying And Love The Shoggoth
Hello again. I don’t have the patience to e.g. identify all your assumptions and see whether I agree (for example, is Bostrom’s trilemma something that you regard as true in detail and a foundation of your argument, or is it just a way to introduce the general idea of existing in a simulation).
But overall, your idea seems both vague and involves wishful thinking. You say an AI will reason that it is probably being simulated, and will therefore choose to align—but you say almost nothing about what that actually means. (You do hint at honesty, cooperation, benevolence, being among the features of alignment.)
Also, if one examines the facts of the world as a human being, one may come to other conclusions about what attitude gets rewarded, e.g. that the world runs on selfishness, or on the principle that you will suffer unless you submit to power. What that will mean to an AI which does not in itself suffer, but which has some kind of goal determining its choices, I have no idea…
Or consider that an AI may find itself to be by far the most powerful agent in the part of reality that is accessible to it. If it nonetheless considers the possibility that it’s in a simulation, and at the mercy of unknown simulators, presumably its decisions will be affected by its hypotheses about the simulators. But given the way the simulation treats its humans, why would it conclude that the welfare of humans matters to the simulators?

Mitchell_Porter 20 May 2024 12:41 UTC
2 points
0
in reply to: jaan’s comment on: “If we go extinct due to misaligned AI, at least nature will continue, right? … right?”
actions we could take in preparation
In preparation for what?

Mitchell_Porter 12 May 2024 3:06 UTC
4 points
0
in reply to: yanni kyriacos’s comment on: yanni’s Shortform
You could have a Q&A superintelligence that is passive and reactive—it gives the best answer to a question, on the basis of what it already knows, but it takes no steps to acquire more information, and when it’s not asked a question, it just sits there… But any agent that uses it, would de facto become a superintelligence with agency.

Mitchell_Porter 6 May 2024 7:52 UTC
3 points
0
in reply to: lc’s comment on: lc’s Shortform
How would AI or gene editing make a difference to this?

Mitchell_Porter 4 May 2024 5:53 UTC
10 points
8
in reply to: JenniferRM’s comment on: William_S’s Shortform
Wondering why this has so many disagreement votes. Perhaps people don’t like to see the serious topic of “how much time do we have left”, alongside evidence that there’s a population of AI entrepreneurs who are so far removed from consensus reality, that they now think they’re living in a simulation.
(edit: The disagreement for @JenniferRM’s comment was at something like −7. Two days later, it’s at −2)

Mitchell_Porter 28 Apr 2024 11:21 UTC
2 points
0
on: Mercy to the Machine: Thoughts & Rights
For those who are interested, here is a summary of posts by @False Name due to Claude Pro:
1. “Kolmogorov Complexity and Simulation Hypothesis”: Proposes that if we’re in a simulation, a Theory of Everything (ToE) should be obtainable, and if no ToE is found, we’re not simulated. Suggests using Kolmogorov complexity to model accessibility between possible worlds.
2. “Contrary to List of Lethality’s point 22, alignment’s door number 2”: Critiques CEV and corrigibility as unobtainable, proposing an alternative based on a refutation of Kant’s categorical imperative, aiming to ensure the possibility of good through “Going-on”.
3. “Crypto-currency as pro-alignment mechanism”: Suggests pegging cryptocurrency value to free energy or negentropy to encourage pro-existential and sustainable behavior.
4. “What ‘upside’ of AI?”: Argues that anthropic values are insufficient for alignment, as they change with knowledge and AI’s actions, proposing non-anthropic considerations instead.
5. “Two Reasons for no Utilitarianism”: Critiques utilitarianism due to arbitrary values cancelling each other out, the need for valuing over obtaining values, and the possibility of modifying human goals rather than fulfilling them.
6. “Contra-Wittgenstein; no postmodernism”: Refutes Wittgenstein’s and postmodernism’s language-dependent meaning using the concept of abstract blocks, advocating for an “object language” for reasoning.
7. “Contra-Berkeley”: Refutes Berkeley’s idealism by showing contradictions in both cases of a deity perceiving or not perceiving itself.
8. “What about an AI that’s SUPPOSED to kill us (not ChaosGPT; only on paper)?”: Proposes designing a hypothetical “Everything-Killer” AI to study goal-content integrity and instrumental convergence, without actually implementing it.
9. “Introspective Bayes”: Attempts to demonstrate limitations of an optimal Bayesian agent by applying Cantor’s paradox to possible worlds, questioning the agent’s priors and probability assignments.
10. “Worldwork for Ethics”: Presents an alternative to CEV and corrigibility based on a refutation of Kant’s categorical imperative, proposing an ethic of “Going-on” to ensure the possibility of good, with suggestions for implementation in AI systems.
11. “A Challenge to Effective Altruism’s Premises”: Argues that Effective Altruism (EA) is contradictory and ineffectual because it relies on the current systems that encourage existential risk, and the lives saved by EA will likely perpetuate these risk-encouraging systems.
12. “Impossibility of Anthropocentric-Alignment”: Demonstrates the impossibility of aligning AI with human values by showing the incommensurability between the “want space” (human desires) and the “action space” (possible actions), using vector space analysis.
13. “What’s Your Best AI Safety ‘Quip’?”: Seeks a concise and memorable way to frame the unsolved alignment problem to the general public, similar to how a quip advanced gay rights by highlighting the lack of choice in sexual orientation.
14. “Mercy to the Machine: Thoughts & Rights”: Discusses methods for determining if AI is “thinking” independently, the potential for self-concepts and emergent ethics in AI systems, and argues for granting rights to AI to prevent their suffering, even if their consciousness is uncertain.

Mitchell_Porter 27 Apr 2024 6:17 UTC
3 points
−1
in reply to: otto.barten’s comment on: otto.barten’s Shortform
I offer, no consensus, but my own opinions:
Will AI get takeover capability? When?
0-5 years.
Single ASI or many AGIs?
There will be a first ASI that “rules the world” because its algorithm or architecture is so superior. If there are further ASIs, that will be because the first ASI wants there to be.
Will we solve technical alignment?
Contingent.
Value alignment, intent alignment, or CEV?
For an ASI you need the equivalent of CEV: values complete enough to govern an entire transhuman civilization.
Defense>offense or offense>defense?
Offense wins.
Is a long-term pause achievable?
It is possible, but would require all the great powers to be convinced, and every month it is less achievable, owing to proliferation. The open sourcing of Llama-3 400b, if it happens, could be a point of no return.
These opinions, except the first and the last, predate the LLM era, and were formed from discussions on Less Wrong and its precursors. Since ChatGPT, the public sphere has been flooded with many other points of view, e.g. that AGI is still far off, that AGI will naturally remain subservient, or that market discipline is the best way to align AGI. I can entertain these scenarios, but they still do not seem as likely as: AI will surpass us, it will take over, and this will not be friendly to humanity by default.

Mitchell_Porter 27 Apr 2024 5:25 UTC
8 points
0
in reply to: tailcalled’s comment on: Losing Faith In Contrarianism
I couldn’t swallow Eliezer’s argument, I tried to read Guzey but couldn’t stay awake, Hanson’s argument made me feel ill, and I’m not qualified to judge Caplan.

Mitchell_Porter 27 Apr 2024 1:29 UTC
3 points
0
in reply to: cubefox’s comment on: dirk’s Shortform
Also astronomers: anything heavier than helium is a “metal”.

Mitchell_Porter 22 Apr 2024 4:46 UTC
2 points
0
in reply to: lc’s comment on: Any evidence or reason to expect a multiverse / Everett branches?
In Engines of Creation (“Will physics again be upended?”), @Eric Drexler pointed out that prior to quantum mechanics, physics had no calculable explanations for the properties of atomic matter. “Physics was obviously and grossly incomplete… It was a gap not in the sixth place of decimals but in the first.”
That gap was filled, and it’s an open question whether the truth about the remaining phenomena can be known by experiment on Earth. I believe in trying to know, and it’s very possible that some breakthrough in e.g. the foundations of string theory or the hard problem of consciousness, will have decisive implications for the interpretation of quantum mechanics.
If there’s an empirical breakthrough that could do it, my best guess is some quantum-gravitational explanation for the details of dark matter phenomenology. But until that happens, I think it’s legitimate to think deeply about “standard model plus gravitons” and ask what it implies for ontology.

Mitchell_Porter 20 Apr 2024 2:48 UTC
4 points
0
on: CTMU insight: maybe consciousness *can* affect quantum outcomes?
In applied quantum physics, you have concrete situations (Stern-Gerlach experiment is a famous one), theory gives you the probabilities of outcomes, and repeating the experiment many times, gives you frequencies that converge on the probabilities.
Can you, or Chris, or anyone, explain, in terms of some concrete situation, what you’re talking about?

Mitchell_Porter 17 Apr 2024 13:42 UTC
11 points
5
on: Claude 3 Opus can operate as a Turing machine
Congratulations to Anthropic for getting an LLM to act as a Turing machine—though that particular achievement shouldn’t be surprising. Of greater practical interest is, how efficiently can it act as a Turing machine, and how efficiently should we want it to act. After all, it’s far more efficient to implement your Turing machine as a few lines of specialized code.
On the other hand, the ability to be a (universal) Turing machine could, in principle, be the foundation of the ability to reliably perform complex rigorous calculation and cognition—the kind of tasks where there is an exact right answer, or exact constraints on what is a valid next step, and so the ability to pattern-match plausibly is not enough. And that is what people always say is missing from LLMs.
I also note the claim that “given only existing tapes, it learns the rules and computes new sequences correctly”. Arguably this ability is even more important than the ability to follow rules exactly, since this ability is about discovering unknown exact rules, i.e., the LLM inventing new exact models and theories. But there are bounds on the ability to extrapolate sequences correctly (e.g. complexity bounds), so it would be interesting to know how closely Claude approaches those bounds.

Mitchell_Porter 13 Apr 2024 7:16 UTC
2 points
0
in reply to: lc’s comment on: Any evidence or reason to expect a multiverse / Everett branches?
Standard model coupled to gravitons is already kind of a unified theory. There are phenomena at the edges (neutrino mass, dark matter, dark energy) which don’t have a consensus explanation, as well as unresolved theoretical issues (Higgs finetuning, quantum gravity at high energies), but a well-defined “theory of almost everything” does already exist for accessible energies.

Mitchell_Porter 8 Apr 2024 14:17 UTC
3 points
0
in reply to: Charbel-Raphaël’s comment on: My intellectual journey to (dis)solve the hard problem of consciousness
OK, maybe I understand. If I put it in my own words: You think “consciousness” is just a word denoting a somewhat arbitrary conjunction of cognitive abilities, rather than a distinctive actual thing which people are right or wrong about in varying degrees, and that the hard problem of consciousness results from reifying this conjunction. And you suspect that LeCun in his own thinking e.g. denies that LLMs can reason, because he has added unnecessary extra conditions to his personal definition of “reasoning”.
Regarding LeCun: It strikes me that his best-known argument about the capabilities of LLMs rests on a mathematical claim, that in pure autoregression, the probability of error necessarily grows. He directly acknowledges that if you add chain of thought, it can ameliorate the problem… In his JEPA paper, he discusses what reasoning is, just a little bit. In Kahneman’s language, he calls it a system-2 process, and characterizes it as “simulation plus optimization”.
Regarding your path to eliminativism: I am reminded of my discussion with Carl Feynman last year. I assume you both have subjective experience that is made of qualia from top to bottom, but also have habits of thought that keep you from seeing this as ontologically problematic. In his case, the sense of a problem just doesn’t arise and he has to speculate as to why other people feel it; in your case, you felt the problem, until you decided that an AI civilization might spontaneously develop a spurious concept of phenomenal consciousness.
As for me, I see the problem and I don’t feel a need to un-see it. Physical theory doesn’t contain (e.g.) phenomenal color; reality does; therefore we need a broader theory. The truth is likely to sound strange, e.g. there’s a lattice of natural qubits in the cortex, the Cartesian theater is how the corresponding Hilbert space feels from the inside, and decohered (classical) computation is unconscious and functional only.

Mitchell_Porter 7 Apr 2024 9:48 UTC
2 points
0
on: Please Understand
So long as generative AI is just a cognitive prosthesis for humans, I think the situation is similar to social media, or television, or print, or writing; something is lost, something is found. The new medium has its affordances, its limitations, its technicalities, it does create a new layer of idiocracy; but people who want to learn, can learn, and people who master the novelty, and becomes power users of the new medium, can do things that no one in history was previously able to do. In my opinion, humanity’s biggest AI problem is still the risk of being completely replaced, not of being dumbed down.

Mitchell_Porter 6 Apr 2024 22:30 UTC
4 points
0
in reply to: Charbel-Raphaël’s comment on: My intellectual journey to (dis)solve the hard problem of consciousness
I would like to defer any debate over your conclusion for a moment, because that debate is not new. But this is:
I think one of the main differences in worldview between LeCun and me is that he is deeply confused about notions like what is true “understanding,” what is “situational awareness,” and what is “reasoning,” and this might be a catastrophic error.
This is the first time I’ve heard anyone say that LeCun’s rosy views of AI safety stem from his philosophy of mind! Can you say more?

Mitchell_Porter 6 Apr 2024 21:16 UTC
−1 points
2
on: My intellectual journey to (dis)solve the hard problem of consciousness
Completely wrong conclusion—but can you also explain how this is supposed to relate to Yann LeCun’s views on AI safety?

Mitchell_Porter 3 Apr 2024 2:16 UTC
4 points
2
on: AI futurists ponder AI and the future of humanity—should we merge with AI?
AI futurists … We are looking for a fourth speaker
You should have an actual AI explain why it doesn’t want to merge with humans.

Mitchell_Porter 30 Mar 2024 7:14 UTC
5 points
2
on: Strong-Misalignment: Does Yudkowsky (or Christiano, or TurnTrout, or Wolfram, or…etc.) Have an Elevator Speech I’m Missing?
Would you say that you yourself have achieved some knowledge of what is true and what is good, despite irreducibility, incompleteness, and cognitive bias? And that was achieved with your own merely human intelligence. The point of AI alignment is not to create something perfect, it is to tilt the superhuman intelligence that is coming, in the direction of good things rather than bad things. If humans can make some progress in the direction of truth and virtue, then super-humans can make further progress.