Simon Fischer

Karma: 471

Simon Fischer Nov 29, 2023, 7:48 PM
1 point
0
in reply to: Logan Zoellner’s comment on: Could Germany have won World War I with high probability given the benefit of hindsight?
I guess it’s hard to keep “they are experimenting with / building huge amounts of tanks” and “they are conducting combined arms exercises” secret from France and Russia, so they would have a lot of advance warning and could then also develop tanks.
But if you have lot more than a layman’s understanding of tank design / combined arms doctrine, you could still come out ahead in this.

Simon Fischer Nov 29, 2023, 11:15 AM
2 points
1
on: Deception Chess: Game #2
“6. f6” should be “6. h3″.

Simon Fischer Nov 21, 2023, 10:06 PM
6 points
1
on: Dialogue on the Claim: “OpenAI’s Firing of Sam Altman (And Shortly-Subsequent Events) On Net Reduced Existential Risk From AGI”
Microsoft is the sort of corporate bureaucracy where dynamic orgs/founders/researchers go to die. My median expectation is that whatever former OpenAI group ends up there will be far less productive than they were at OpenAI.

I’m a bit sceptical of that. You gave some reasonable arguments, but all of this should be known to Sam Altman, and he still chose to accept Microsoft’s offer instead of founding his own org (I’m assuming he would easily able to raise a lot of money). So, given that “how productive are the former OpenAI folks at Microsoft?” is the crux of the argument, it seems that recent events are good news iff Sam Altman made a big mistake with that decision.

Simon Fischer Nov 15, 2023, 12:22 PM
1 point
0
in reply to: Eliezer Yudkowsky’s comment on: Does davidad’s uploading moonshot work?
I’m confused by this statement. Are you assuming that AGI will definitely be built after the research time is over, using the most-plausible-sounding solution?
Or do you believe that you understand NOW that a wide variety of approaches to alignment, including most of those that can be thought of by a community of non-upgraded alignment researchers (CNUAR) in a hundred years, will kill everyone and that in a hundred years the CNUAR will not understand this?
If so, is this because you think you personally know better or do you predict the CNUAR will predictably update in the wrong direction? Would it matter if you got to choose the composition of the CNUAR?

Simon Fischer Nov 12, 2023, 7:55 PM
4 points
0
in reply to: Bird Concept’s comment on: Does davidad’s uploading moonshot work?
Another big source of potential volunteers: People who are going to be dead soon anyway. I’d probably volunteer if I knew that I’m dying from cancer in a few weeks anyway.

Simon Fischer Nov 12, 2023, 11:22 AM
4 points
2
on: Game Theory without Argmax [Part 1]
${argmax}_{R} (sin) = {2 k + \frac{π}{2} | k \in Z}$
Typo: This should be ${argmax}_{R} (sin) = {2 π k + \frac{π}{2} | k \in Z}$ .

Simon Fischer Nov 6, 2023, 12:54 PM
2 points
1
on: Deception Chess: Game #1
after 17… dxc6 or 17. c6
This should probably be “after 17… cxd6 or 17… c6”.

Simon Fischer Oct 17, 2023, 1:30 PM
4 points
0
in reply to: PottedRosePetal’s comment on: The Good Life in the face of the apocalypse
I suspect Wave refers to this company: https://www.wave.com/en/ (they are connected to EA)
Planecrash is a glowfic co-written by Yudkowsky: https://glowficwiki.noblejury.com/books/planecrash

Simon Fischer Jul 23, 2023, 12:45 PM
1 point
0
in reply to: simple_name’s comment on: I’m consistently overwhelmed by basic obligations. Are there any paradigm shifts or other rationality-based tips that would be helpful?
Seconding the recommendation of the rest in motion post, it has helped me with a maybe-similar feeling.

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

Jun 27, 2023, 6:05 AM

38 points

2 comments15 min readLW link

Simon Fischer Mar 10, 2023, 2:04 AM
7 points
3
in reply to: johnswentworth’s comment on: Why Not Just Outsource Alignment Research To An AI?
I don’t believe these “practical” problems (“can’t try long enough”) generalize enough to support your much more general initial statement. This doesn’t feel like a true rejection to me, but maybe I’m misunderstanding your point.

Simon Fischer Mar 10, 2023, 1:50 AM
6 points
1
in reply to: johnswentworth’s comment on: Why Not Just Outsource Alignment Research To An AI?
I think I mostly agree with this, but from my perspective it hints that you’re framing the problem slightly wrong. Roughly, the problem with the outsourcing-approaches is our inability to specify/verify solutions to the alignment problem, not that specifying is not in general easier than solving yourself.

(Because of the difficulty of specifying the alignment problem, I restricted myself to speculating about pivotal acts in the post linked above.)

Simon Fischer Mar 10, 2023, 1:40 AM
5 points
1
in reply to: johnswentworth’s comment on: Why Not Just Outsource Alignment Research To An AI?
But you don’t need to be able to code to recognize that a software is slow and buggy!?

About the terrible UI part I agree a bit more, but even there one can think of relatively objective measures to check usability without being able to speak python.

Simon Fischer Mar 10, 2023, 1:23 AM
3 points
1
in reply to: johnswentworth’s comment on: Why Not Just Outsource Alignment Research To An AI?
In cases where outsourcing succeeds (to various degrees), I think the primary load-bearing mechanism of success in practice is usually not “it is easier to be confident that work has been done correctly than to actually do the work”, at least for non-experts.
I find this statement very surprising. Isn’t almost all of software development like this?
E.g., the client asks the developer for a certain feature and then clicks around the UI to check if it’s implemented / works as expected.

Simon Fischer Mar 10, 2023, 12:39 AM
8 points
0
in reply to: Jonathan Paulson’s comment on: Why Not Just Outsource Alignment Research To An AI?
“This is what it looks like in practice, by default, when someone tries to outsource some cognitive labor which they could not themselves perform.”
This proves way too much.
I agree, I think this even proves P=NP.

Maybe a more reasonable statement would be: You can not outsource cognitive labor if you don’t know how to verify the solution. But I think that’s still not completely true, given that interactive proofs are a thing. (Plug: I wrote a post exploring the idea of applying interactive proofs to AI safety.)

Simon Fischer Aug 22, 2022, 1:12 PM
3 points
0
in reply to: Stephen McAleese’s comment on: Pivotal acts using an unaligned AGI?
No, that’s not quite right. What you are describing is the NP-Oracle.
On the other hand, with the IP-Oracle we can (in principle, limited by the power of the prover/AI) solve all problems in the PSPACE complexity class.
Of course, PSPACE is again a class of decision problems, but using binary search it’s straightforward to extract complete answers like the designs mentioned later in the article.

Pivotal acts using an unaligned AGI?

Simon FischerAug 21, 2022, 5:13 PM

28 points

3 comments7 min readLW link

Simon Fischer Aug 5, 2022, 1:10 PM
−4 points
−2
in reply to: Not Relevant’s comment on: Two-year update on my personal AI timelines
Your reasoning here relies on the assumption that the learning mostly takes place during the individual organisms lifetime. But I think it’s widely accepted that brains are not “blank slates” at birth of the organism, but contain significant amount of information, akin to a pre-trained neural network. Thus, if we consider evolution as the training process, we might reach the opposite conclusion: Data quantity and training compute are extremely high, while parameter count (~brain size) and brain compute is restricted and selected against.

Simon Fischer May 25, 2022, 1:40 PM
8 points
0
on: Mandatory Post About Monkeypox
Thank you for writing about this! A minor point: I don’t think aerosolizing monkeypox suspensions using a nebulizer can be counted as gain of function research, not even “at least kind of”. (Or do I lack reading comprehension and misunderstood something?)

Simon Fischer Apr 7, 2022, 8:11 PM
3 points
0
on: Project Intro: Selection Theorems for Modularity
Hypothesis: If a part of the computation that you want your trained system to compute “factorizes”, it might be easier to evolve a modular system for this computation. By factorization I just mean that (part of) the computation can be performed using mostly independent parts / modules.

Reasoning: Training independent parts to each perform some specific sub-calculation should be easier than training the whole system at once. E.g. training n neural networks of size N/n should be easier (in terms of compute or data needed) than training one of size N, given the exponential size of the parameter space.
This hypothesis might explain the appearance of modularity if the necessary initial conditions for this selective advantage to be used are regularly present.

(I’ve talked about this idea with Lblack already but wanted to spell it out a bit more and post it here for reference.)

Simon Fischer

AISC team re­port: Soft-op­ti­miza­tion, Bayes and Goodhart

Pivotal acts us­ing an un­al­igned AGI?

AISC team report: Soft-optimization, Bayes and Goodhart

Pivotal acts using an unaligned AGI?