habryka comments on Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI

habryka 2 Dec 2024 17:55 UTC
LW: 1 AF: 7
0
AF
This essay seems to have lost the plot of where the problems with AI come from. I was historically happy that Conjecture focused on the parts of AI development that are really obviously bad, like having a decent chance of literally killing everyone or permanently disempowering humanity, but instead this seems like it’s a random rant against AI-generated art, and name-calling of obviously valuable tools like AI coding assistants .
I am not sure what happened. I hope you find the plot again.
- Connor Leahy 2 Dec 2024 21:47 UTC
  LW: 33 AF: 11
  15
  AF Parent
  Hi habryka, I don’t really know how best to respond to such a comment. First, I would like to say thank you for your well-wishes, assuming you did not mean them sarcastically. Maybe I have lost the plot, and if so, I do appreciate help in recovering it. Secondly, I feel confused as to why you would say such things in general.
  Just last month, me and my coauthors released a 100+ page explanation/treatise on AI extinction risk that gives a detailed account of where AGI risk comes from and how it works, which was received warmly by LW and the general public alike, and which continues to be updated and actively publicised.
  In parallel, our sister org ControlAI, a non-profit policy advocacy org focused solely on extinction risk prevention I work with frequently, has had A Narrow Path, a similarly extensive writeup on principles of regulation to address xrisk from ASI, which me and ControlAI have pushed and discussed extensively with policy makers of multiple countries, and there are other regulation-promoting projects ongoing.
  I have been on CNN, BBC, Fox News and other major news sources warning in no ambiguous terms about the risks. There is literally dozens of hours of podcast material, including from just last month, where I explain in excruciating depth the existential risk posed by AGI systems and where it comes from, and how it differs from other forms of AI risk. If you think all my previous material has “lost the plot”, then well, I guess in your eyes I never had it, not much I can do.
  This post is a technical agenda that is not framed in the usual LW ideological ontology, and has not been optimized to appeal to that audience, but rather to identify an angle that is tractable and generalizes the problem without losing its core, and leads to solutions that address the hard core, which is Complexity. In the limit, if we had beautifully simple, legible designs for ASIs that we fully understand and can predict, technical xrisk (but not governance) would be effectively solved. If you disagree with this, I would have greatly enjoyed your engagement with what object level points you think are wrong, and it may have helped me write a better roadmap.
  But it seems to me that you have not even tried to engage with the content of this post at all, and have instead merely asserted it is a “random rant against AI-generated art” and “name-calling.” I see no effort other than surface level pattern matching, or any curiosity to how it might fit with my previous writings and thinking that have been shared and discussed.
  Do you truly think that’s the best effort at engaging in good faith you can make?
  If so, I don’t know what I can say that would help. I hope we can both find the plot again, since neither of us seem to see it in the other person.
  - plex 2 Dec 2024 23:17 UTC
    9 points
    5
    Parent
    [epistemic status: way too ill to be posting important things]
    hi fellow people-who-i-think-have-much-of-the-plot
    you two seem, from my perspective as having read a fair amount of content from both, to have a bunch of similar models and goals, but quite different strategies.
    on top of both having a firm grip on the core x-risk arguments, you both call out similar dynamics in capabilities orgs capturing will to save the world and turning it into more capabilities progress^[1], you both take issue with somewhat different but i think related parts of openphil’s grantmaking process, you both have high p(doom) and not very comfortable timelines, etc.
    i suspect if connor explained why he was focusing on the things he is here, that would uncover the relevant difference. my current guess is connor is doing a kind of political alliancebuilding which is colliding with some of habryka’s highly active integrity reflexes.
    maybe this doesn’t change much, these strategies do seem at least somewhat collision-y as implemented so far, but i hope our kind can get along.
    ^
    e.g. “Turning care into acceleration” from https://www.thecompendium.ai/the-ai-race#these-ideologies-shape-the-playing-field
    
    e.g. https://www.lesswrong.com/posts/h4wXMXneTPDEjJ7nv/a-rocket-interpretability-analogy?commentId=md7QvniMyx3vYqeyD and lots of calling out Anthropic