Ben Pace comments on Vote on Interesting Disagreements

Ben Pace 24 Oct 2023 0:35 UTC
20 points
0
Poll For Topics of Discussion and Disagreement
Use this thread to (a) upvote topics you’re interested in reading about, (b) agree/disagree with positions, and (c) add new positions for people to vote on.
Note: Hit cmd-f or ctrl-f (whatever normally opens search) to automatically expand all of the poll options below.
What links here?
- Vote on Interesting Disagreements by Ben Pace (7 Nov 2023 21:35 UTC; 159 points)
- Ben Pace 24 Oct 2023 0:42 UTC
  99 points
  1
  Parent
  Prosaic Alignment is currently more important to work on than Agent Foundations work.
- 1a3orn 8 Nov 2023 2:04 UTC
  86 points
  1
  Parent
  LLMs as currently trained run ~0 risk of catastrophic instrumental convergence even if scaled up with 1000x more compute
- Ben Pace 7 Nov 2023 21:10 UTC
  77 points
  −1
  Parent
  Academia is sufficiently dysfunctional that if you want to make a great scientific discovery you should basically do it outside of academia.
- habryka 7 Nov 2023 23:54 UTC
  68 points
  0
  Parent
  Pursuing plans that cognitively enhance humans while delaying AGI should be our top strategy for avoiding AGI risk
- habryka 7 Nov 2023 23:49 UTC
  68 points
  0
  Parent
  Current progress in AI governance will translate with greater than 50% probability into more than a 2 year counterfactual delay of dangerous AI systems
- ryan_greenblatt 8 Nov 2023 4:52 UTC
  65 points
  0
  Parent
  Ambitious mechanistic interpretability is quite unlikely^[1] to be able to confidently assess^[2] whether AIs^[3] are deceptively aligned (or otherwise have dangerous propensities) in the next 10 years.
  ↩︎
  greater than 90% failure
  
  ↩︎
  likelihood ratio of 10
  
  ↩︎
  I’m refering to which ever AIs are pivotal or cruxy for things to go well prior to human obsolescence.
- Quadratic Reciprocity 8 Nov 2023 0:40 UTC
  65 points
  0
  Parent
  It is very unlikely AI causes an existential catastrophe (Bostrom or Ord definition) but doesn’t result in human extinction. (That is, non-extinction AI x-risk scenarios are unlikely)
- Quadratic Reciprocity 8 Nov 2023 1:44 UTC
  60 points
  2
  Parent
  Things will basically be fine regarding job loss and unemployment due to AI in the next several years and those worries are overstated
- Jonas V 7 Nov 2023 23:57 UTC
  43 points
  0
  Parent
  The current AI x-risk grantmaking ecosystem is bad and could be improved substantially.
- Ben Pace 7 Nov 2023 21:10 UTC
  40 points
  2
  Parent
  People aged 12 to 18 should basically be treated like adults rather than basically treated like children.
- Ben Pace 24 Oct 2023 0:55 UTC
  40 points
  −7
  Parent
  It is critically important for US/EU companies to build AGI before Chinese companies.
- Quadratic Reciprocity 8 Nov 2023 0:34 UTC
  38 points
  0
  Parent
  EAs and rationalists should strongly consider having lots more children than they currently are
- habryka 7 Nov 2023 23:55 UTC
  38 points
  0
  Parent
  Meaningness’s “Geeks Mops and Sociopaths” model is an accurate model of the dynamics that underlie most social movements
- Nathan Simons 8 Nov 2023 2:41 UTC
  37 points
  −1
  Parent
  Irrefutable evidence of extraterrestrial life would be a good thing.
- habryka 7 Nov 2023 23:56 UTC
  33 points
  0
  Parent
  It was a mistake to increase salaries in the broader EA/Rationality/AI-Alignment ecosystem between 2019 and 2022
- Garrett Baker 9 Nov 2023 0:01 UTC
  31 points
  0
  Parent
  Good AGI-notkilleveryoneism-conscious researchers should in general prioritize working at big AGI labs over working independently, for alignment-focused labs, or for academia marginally more than they currently do.
- Garrett Baker 8 Nov 2023 23:57 UTC
  31 points
  0
  Parent
  The ratio of good alignment work done at labs vs independently mostly skews toward labs
  
  Good meaning something different from impactful here. Obviously AGI labs will pay more attention to their researchers or researchers from respectable institutions than independent researchers. Your answer should factor out such considerations.
  
  Edit: Also normalize for quantity of researchers.
- trevor 8 Nov 2023 1:20 UTC
  31 points
  −1
  Parent
  Someone in the AI safety community (e.g. Yud, Critch, Salamon, you) can currently, within 6 month’s effort, write a 20,000 word document that would pass a threshold for a coordination takeoff on Earth, given that 1 million smart Americans and Europeans would read all of it and intended to try out much of the advice (i.e. the doc succeeds given 1m serious reads, it doesn’t need to cause 1m serious reads). Copy-pasting already-written documents/posts would count.
- Ben Pace 24 Oct 2023 0:40 UTC
  29 points
  0
  Parent
  There is a greater than 20% chance that the Effective Altruism movement has been net negative for the world.
- mattmacdermott 8 Nov 2023 14:07 UTC
  28 points
  0
  Parent
  Empirical agent foundations is currently a good idea for a research direction.
- Ben Pace 7 Nov 2023 21:15 UTC
  27 points
  2
  Parent
  A basic deontological and straightforward morality (such as that exmplified by Hermione in HPMOR) is basically right; this is in contrast with counterintuitive moralities that suggest evil-tinted people (like Quirrell in HPMOR) are also valid ways of being moral.
- Jonas V 7 Nov 2023 23:58 UTC
  26 points
  −1
  Parent
  Just like the last 12 months was the time of the chatbots, the next 12 months will be the time of agent-like AI product releases.
- mattmacdermott 8 Nov 2023 14:01 UTC
  24 points
  0
  Parent
  The work of agency-adjacent research communities such as artificial life, complexity science and active inference is at least as relevant to AI alignment as LessWrong-style agent foundations research is.
- trevor 8 Nov 2023 0:55 UTC
  22 points
  7
  Parent
  American intelligence agencies consider AI safety to be substantially more worth watching than most social movements
- leogao 10 Nov 2023 21:09 UTC
  21 points
  −1
  Parent
  It is possible to make meaningful progress on deceptive alignment using experiments on current models
- Jonas V 7 Nov 2023 23:52 UTC
  21 points
  −7
  Parent
  Having another $1 billion to prevent AGI x-risk would be useful because we could spend it on large compute budgets for safety research teams.
- Ben Pace 24 Oct 2023 0:49 UTC
  21 points
  1
  Parent
  Moloch is winning.
- Tapatakt 8 Nov 2023 19:48 UTC
  20 points
  −7
  Parent
  “Polyamory-as-a-default-option” would be a better social standard than “Monogamy-as-a-default-option”.
- Gesild Muka 8 Nov 2023 14:53 UTC
  20 points
  0
  Parent
  The rationality community will noticeably spill over into other parts of society in the next ten years. Examples: entertainment, politics, media, art, sports, education etc.
- trevor 8 Nov 2023 0:58 UTC
  20 points
  0
  Parent
  At least one American intelligence agency is concerned about the AI safety movement potentially decelerating the American AI industry, against the administration/natsec community’s wishes
- Ben Pace 7 Nov 2023 21:16 UTC
  20 points
  0
  Parent
  I broadly agree with the claim that “most people don’t do anything and the world is very boring”.
- Ben Pace 24 Oct 2023 0:43 UTC
  20 points
  −1
  Parent
  On the current margin most people would be better off involving more text-based communication in their lives than in-person communication.
- mattmacdermott 8 Nov 2023 13:57 UTC
  19 points
  0
  Parent
  Agent foundations research should become more academic on the margin (for example by increasing the paper to blogpost ratio, and by putting more effort into relating new work to existing literature).
- habryka 8 Nov 2023 0:28 UTC
  19 points
  6
  Parent
  Current progress in AI governance will translate with greater than 20% probability into more than a 2 year counterfactual delay of dangerous AI systems
- Ben Pace 24 Oct 2023 0:43 UTC
  19 points
  0
  Parent
  Rationality should be practiced for Rationality’s sake (rather than for the sake of x-risk).
- leogao 10 Nov 2023 21:09 UTC
  18 points
  1
  Parent
  It is possible to make meaningful progress on ELK using empirical experiments on current models
- Seth Herd 8 Nov 2023 20:01 UTC
  18 points
  0
  Parent
  Language model agents are likely (>20%) to produce AGI (including the generalization to foundation model-based cognitive architectures)
- Quadratic Reciprocity 8 Nov 2023 1:47 UTC
  18 points
  0
  Parent
  Current AI safety university groups are overall a good idea and helpful, in expectation, for reducing AI existential risk
- Jonas V 7 Nov 2023 23:52 UTC
  18 points
  0
  Parent
  Having another $1 billion to prevent AGI x-risk would be useful because we could spend it on large-scale lobbying efforts in DC.
- Ben Pace 7 Nov 2023 21:13 UTC
  18 points
  0
  Parent
  Immersion into phenomena is better for understanding them than trying to think through at the gears-level, on the margin for most people who read LessWrong.
- P. 8 Nov 2023 22:10 UTC
  17 points
  0
  Parent
  Public mechanistic interpretability research is net positive in expectation.
- elifland 8 Nov 2023 4:38 UTC
  17 points
  0
  Parent
  Among existing alignment research agendas/projects, Superalignment has the highest expected value
- calebp99 7 Nov 2023 21:54 UTC
  17 points
  9
  Parent
  Most LWers should rely less on norms of their own (or the LW community’s) design, and instead defer to regular societal norms more.
- Gordon Seidoh Worley 8 Nov 2023 3:48 UTC
  16 points
  −1
  Parent
  Rationalists would be better off if they were more spiritual/religious
- Garrett Baker 8 Nov 2023 0:13 UTC
  16 points
  0
  Parent
  Effective altruism can be well modeled by cynically thinking of it as just another social movement, in the sense that those a part of it are mainly jockeying for in-group status, and making costly demonstrations to their in-group & friends that they care about other sentiences more than others in the in-group. Its just that EA has more cerebral standards than others.
- Nate Showell 12 Nov 2023 21:32 UTC
  15 points
  0
  Parent
  “Agent” is an incoherent concept.
- Tapatakt 8 Nov 2023 19:38 UTC
  15 points
  0
  Parent
  “Open-source LLM-based agent with hacking abilities starts spreading itself over the Internet because some user asked it to do so or to do something like to conquer the world” is a quite probable point-of-no-return regarding AGI risk.
- Jonas V 7 Nov 2023 23:51 UTC
  15 points
  0
  Parent
  Investing in early-stage AGI companies helps with reducing x-risk (via mission hedging, having board seats, shareholder activism)
- Ben Pace 24 Oct 2023 1:23 UTC
  15 points
  0
  Parent
  Great art is rarely original and mostly copied.
- Garrett Baker 10 Nov 2023 22:27 UTC
  14 points
  0
  Parent
  The younger generation of rationalists are less interesting than the older generation was when that old generation had the same experience as the young generation currently does.
- Garrett Baker 10 Nov 2023 5:13 UTC
  14 points
  2
  Parent
  At least one of {Anthropic, OpenAI, Deepmind} is net-positive compared to the counterfactual where just before founding the company, its founders were all discretely paid $10B by a time-travelling PauseAI activist not to found the company and to exit the industry for 30 years, and this worked.
- Ben Pace 7 Nov 2023 21:10 UTC
  14 points
  0
  Parent
  One should basically not invest into having “charisma”.
- trevor 8 Nov 2023 13:58 UTC
  13 points
  0
  Parent
  The most valuable new people joining AI safety will usually take ~1-3 years of effort to begin to be adequately sorted and acknowledged for their worth, unless they are unusually good at self-promotion e.g. gift of gab, networking experience, and stellar resume.
- johnswentworth 7 Nov 2023 23:16 UTC
  13 points
  0
  Parent
  Poll feature on LW: Yay or Nay?
- lc 8 Nov 2023 3:16 UTC
  12 points
  0
  Parent
  There is a greater than 80% chance that effective altruism has been net-negative for the world.
- trevor 8 Nov 2023 0:46 UTC
  12 points
  0
  Parent
  If rationality took off in China, it would yield higher EV from potentially spreading to the rest of the world than from potentially accelerating China.
- romeostevensit 10 Nov 2023 14:02 UTC
  11 points
  0
  Parent
  When people try to discuss philosophy, math, or science, especially pre-paradigmatic fields such as ai safety, they use a lot of metaphorical thinking to extend from familiar concepts to new concepts. It would be very helpful and people would stop talking past each other so much if they practiced being explicitly aware of these mental representations and directly shared them rather than pretending that something more rigorous is happening. This is part of Alfred Korzybski’s original rationality project, something he called ‘consciousness of abstraction.’
- leogao 9 Nov 2023 21:54 UTC
  11 points
  0
  Parent
  Language model agents are very likely (>80%) to produce AGI (including the generalization to foundation model-based cognitive architectures)
- P. 8 Nov 2023 20:55 UTC
  11 points
  0
  Parent
  Research into getting a mechanistic understanding of the brain for purposes of at least one of: understanding how values/empathy works in people, brain uploading or improving cryonics/plastination is net positive and currently greatly underfunded.
- Garrett Baker 8 Nov 2023 0:07 UTC
  11 points
  0
  Parent
  Most persistent disagreements can more usefully be thought of as a difference in priors rather than a difference in evidence or rationality.
- Screwtape 8 Nov 2023 4:50 UTC
  10 points
  1
  Parent
  A Secular Solstice variation designed to work weekly (akin to Sunday Service or Shabbat) would be positive for rationalists, both for community and for the thought processes of the members.
- lc 7 Nov 2023 23:53 UTC
  10 points
  −1
  Parent
  There is a greater than 50% chance that the Effective Altruism movement has been net negative for the world.
- Ben Pace 7 Nov 2023 21:16 UTC
  10 points
  0
  Parent
  Most LWers are prioritizing their slack too much.
- aysja 13 Nov 2023 7:05 UTC
  9 points
  0
  Parent
  “Intelligence” can be characterized with a similar level of theoretical precision as e.g., heat, motion, and information. (In other words: it’s less like a messy, ad-hoc phenomena and more like a deep, general fact about our world).
- P. 9 Nov 2023 20:35 UTC
  9 points
  0
  Parent
  You know of a technology that has at least a 10% chance of having a very big novel impact on the world (think the internet or ending malaria) that isn’t included in this list, very similar, or downstream from some element of it: AI, mind uploads, cryonics, human space travel, geo-engineering, gene drives, human intelligence augmentation, anti-aging, cancer cures, regenerative medicine, human genetic engineering, artificial pandemics, nuclear weapons, proper nanotech, very good lie detectors, prediction markets, other mind-altering drugs, cryptocurrency, better batteries, BCIs, nuclear fusion, better nuclear fission, better robots, AR, VR, room-temperature superconductors, quantum computers, polynomial time SAT solvers, cultured meat, solutions to antibiotic resistance, vaccines to some disease, optical computers, artificial wombs, de-extinction and graphene.
  Bad options included just in case someone thinks they are good.
- trevor 8 Nov 2023 1:03 UTC
  9 points
  0
  Parent
  Xi Jinping thinks that economic failure in the US or China, e.g. similar to 2008, is one of the most likely things to change the global balance of power.
- Jonas V 8 Nov 2023 0:06 UTC
  9 points
  0
  Parent
  Having another $1 billion to prevent AGI x-risk would be pretty useful.
- P. 8 Nov 2023 21:09 UTC
  8 points
  −1
  Parent
  If we had access to a brain upload (and maybe a world simulator too) we could in principle extract something like a utility function, and the theory behind it relates more to agents in general than it does to humans in particular.
- trevor 8 Nov 2023 1:56 UTC
  8 points
  0
  Parent
  Any activity or action taken after drinking coffee in the morning will strongly reward/reinforce that action/activity
- faul_sname 8 Nov 2023 1:44 UTC
  8 points
  1
  Parent
  Humans are the dominant species on earth primarily because our individual intelligence surpassed the necessary threshold to sustain civilization and take control of our environment.
- trevor 8 Nov 2023 1:01 UTC
  8 points
  0
  Parent
  American intelligence agencies are actively planning to defend the American AI industry against foreign threats (e.g. Russia, China).
- quetzal_rainbow 10 Nov 2023 13:31 UTC
  7 points
  0
  Parent
  If you can write prompt for GPT-2000 such that completion of this prompt results in aligned pivotal act, you can just use knowledge necessary for writing this prompt to Just Build aligned ASI, without necessity to use GPT-2000.
- Garrett Baker 10 Nov 2023 4:59 UTC
  7 points
  0
  Parent
  Rationalist rituals like Petrov day or the Secular Solstices should be marginally more emphasized within those collections of people who call themselves rationalists.
- trevor 8 Nov 2023 0:44 UTC
  7 points
  0
  Parent
  Rationality is likely to organically gain popularity in China (e.g. quickly reaching 10,000 people or reaching 100,000 by 2030, e.g. among scientists or engineers, etc).
- Ben Pace 7 Nov 2023 21:11 UTC
  7 points
  0
  Parent
  It is wrong to protest AI labs.
- Garrett Baker 8 Nov 2023 23:59 UTC
  6 points
  0
  Parent
  The ratio of good alignment work done at labs vs in academia mostly skews toward labs
  
  Good meaning something different from impactful here. Obviously AGI labs will pay more attention to their researchers or researchers from respectable institutions than academics. Your answer should factor out such considerations.
  
  Edit: Also normalize for quantity of researchers.
- Garrett Baker 8 Nov 2023 23:58 UTC
  6 points
  0
  Parent
  The ratio of good alignment work done in academia vs independently mostly skews toward academia
  
  Good meaning something different from impactful here. Possibly AGI labs will pay more attention to academics than independent researchers. Your answer should factor out such considerations.
  
  Edit: Also normalize for quantity of researchers.
- trevor 8 Nov 2023 1:06 UTC
  6 points
  0
  Parent
  At least one mole, informant, or spy has been sent by a US government agency or natsec firm to infiltrate the AI safety community by posing as a new member (even if it’s just to ask questions in causal conversations at events about recent happenings or influential people’s priorities).
- Ben Pace 7 Nov 2023 21:15 UTC
  6 points
  −1
  Parent
  The government should build nuclear-driven helicopters, like nuclear subs.
- leogao 10 Nov 2023 21:07 UTC
  5 points
  −1
  Parent
  An alignment technique that can fully align GPT-4 is likely (>50%) to also fully align the first existentially dangerous AGI
- Thomas Kwa 10 Nov 2023 11:41 UTC
  5 points
  0
  Parent
  Most end-to-end “alignment plans” are bad because research will be incremental. For example, Superalignment’s impact will mostly come from adapting to the next ~3 years of AI discoveries and working on relevant subproblems like interp, rather than creating a superhuman alignment researcher.
- Thomas Kwa 10 Nov 2023 11:35 UTC
  5 points
  0
  Parent
  Conceptual alignment work on concepts like “agency”, “optimization”, “terminal values”, “abstractions”, “boundaries” is mostly intractable at the moment.
- Ben Pace 7 Nov 2023 21:14 UTC
  5 points
  0
  Parent
  Most LWers are not prioritizing their slack enough.
- Garrett Baker 10 Nov 2023 5:07 UTC
  4 points
  0
  Parent
  Those who call themselves rationalists or EAs should drink marginally more alcohol at social events.
- KurtB 8 Nov 2023 1:25 UTC
  4 points
  0
  Parent
  In human interactions at any scale, it is net good to at least momentarily consider the Elephant and/or the Player, with very few exceptions.
- Ben Pace 8 Nov 2023 0:44 UTC
  4 points
  −1
  Parent
  Most of the time, power-seeking behavior in humans is morally good or morally neutral.
- a gently pricked vein 14 Nov 2023 18:01 UTC
  3 points
  0
  Parent
  Computer science & ML will become lower in relevance/restricted in scope for the purposes of working with silicon-based minds, just as human-neurosurgery specifics are largely but not entirely irrelevant for most civilization-scale questions like economic policy, international relations, foundational research, etc.
  
  Or IOW: Model neuroscience (and to some extent, model psychology) requires more in-depth CS/ML expertise than will the smorgasbord of incoming subfields of model sociology, model macroeconomics, model corporate law, etc.
- a gently pricked vein 14 Nov 2023 17:38 UTC
  3 points
  0
  Parent
  The virtue of the void is indeed the virtue above all others (in rationality), and fundamentally unformalizable.
- a gently pricked vein 14 Nov 2023 17:34 UTC
  3 points
  0
  Parent
  There is likely a deep compositional structure to be found for alignment, possibly to the extent that AGI alignment could come from “merely” stacking together “microalignment”, even if in non-trivial ways.
- quetzal_rainbow 10 Nov 2023 13:27 UTC
  3 points
  0
  Parent
  If you can’t write a program that produces aligned (under whatever definition of alignment you use) output being run on unphysically large computer, you can’t deduce from training data or weights of superintelligent neural network if it produces aligned output.
- Thomas Kwa 10 Nov 2023 11:43 UTC
  3 points
  0
  Parent
  There are arguments for convergent instrumental pressures towards catastrophe, but the required assumptions are too strong for the arguments to clearly go through.
- P. 8 Nov 2023 21:23 UTC
  3 points
  0
  Parent
  Cultural values are something like preferences over pairs of social environments and things we actually care about. So it makes sense to talk about jointly optimizing them.
- Elizabeth 12 Nov 2023 18:51 UTC
  2 points
  0
  Parent
  It’s good for EA orgs to pay well
- Garrett Baker 10 Nov 2023 22:27 UTC
  2 points
  0
  Parent
  The younger generation of EAs are less interesting than the older generation was when that old generation had the same experience as the young generation currently does.
- Charbel-Raphaël 10 Nov 2023 11:14 UTC
  2 points
  0
  Parent
  In the context of Offense-defense balance, offense has a strong advantage
- tailcalled 8 Nov 2023 11:20 UTC
  2 points
  0
  Parent
  Generative AI like LLMs or diffusion will eventually be superseded by human AI researchers coming up with something autonomous.
- a gently pricked vein 14 Nov 2023 17:50 UTC
  1 point
  0
  Parent
  EA has gotten a little more sympathetic to vibes-based reasoning recently, and will continue to incorporate more of it.
- a gently pricked vein 14 Nov 2023 17:47 UTC
  1 point
  0
  Parent
  The mind (ie. your mind), and how it is experienced from the inside, is potentially a very rich source of insights for keeping AI minds aligned on the inside.
- Slapstick 9 Nov 2023 16:49 UTC
  1 point
  0
  Parent
  All else equal, a unit of animal suffering should be accorded the same moral weight as an equivalent unit of human suffering. (i.e. equal consideration for equal interests)
- mattmacdermott 8 Nov 2023 14:03 UTC
  1 point
  0
  Parent
  ‘Descriptive’ agent foundations research is currently more important to work on than ‘normative’ agent foundations research.
- tailcalled 8 Nov 2023 11:14 UTC
  1 point
  0
  Parent
  Autism is the extreme male version of a male-female difference in systemic vs empathic thinking.
- trevor 8 Nov 2023 1:52 UTC
  1 point
  0
  Parent
  Developing a solid human intelligence/skill evaluation metric would be a high-EV project for AI safety, e.g. to make it easier to invest in moving valuable AI safety people to the Bay Area/London from other parts of the US/UK.
- lc 30 Nov 2023 21:38 UTC
  0 points
  0
  Parent
  MadHatter is an original and funny satirist. The universally serious reaction to his jokeposts is a quintessential example of rationalist humorlessness.
- Ben Pace 7 Nov 2023 21:15 UTC
  0 points
  −1
  Parent
  People should pay an attractiveness tax to the government.

Ben Pace comments on Vote on Interesting Disagreements

Note: Hit cmd-f or ctrl-f (whatever normally opens search) to automatically expand all of the poll options below.