habryka comments on O O’s Shortform

habryka 27 Aug 2024 0:07 UTC
37 points
11
My sense is you can combat this, but a lot of this equivocation sticking is because x-risk safety people are actively trying to equivocate these things because that gets them political capital with the left, which is generally anti-tech.
Some examples (not getting links for all of these because it’s too much work, but can get them if anyone is particularly interested):
- CSER trying to argue that near-term AI harms are the same as long-term AI harms
- AI Safety Fundamentals listing like a dozen random leftist “AI issues” in their article on risks from AI before going into any takeover stuff
- The executive order on AI being largely about discrimination and AI bias, mostly equivocating between catastrophic and random near-term harms
- Safety people at OpenAI equivocating between brand-safety and existential-safety because that got them more influence within the organization
In some sense one might boil this down to memetic selection pressure, but I think the causal history of this equivocation is more dependent on the choices of a relatively small set of people.
- Ivan Vendrov 28 Aug 2024 15:35 UTC
  4 points
  0
  Parent
  definitely agree there’s some power-seeking equivocation going on, but wanted to offer a less sinister explanation from my experiences in AI research contexts. Seems that a lot of equivocation and blurring of boundaries comes from people trying to work on concrete problems and obtain empirical information. a thought process like
  1. alignment seems maybe important?
  2. ok what experiment can I set up that lets me test some hypotheses
  3. can’t really test the long-term harms directly, let me test an analogue in a toy environment or on a small model, publish results
  4. when talking about the experiments, I’ll often motivate them by talking about long-term harm
  Not too different from how research psychologists will start out trying to understand the Nature of Mind and then run a n=20 study on undergrads because that’s what they had budget for. We can argue about how bad this equivocation is for academic research, but it’s a pretty universal pattern and well-understood within academic communities.
  The unusual thing in AI is that researchers have most of the decision-making power in key organizations, so these research norms leak out into the business world, and no-one bats an eye at a “long-term safety research” team that mostly works on toy and short term problems.
  This is one reason I’m more excited about building up “AI security” as a field and hiring infosec people instead of ML PhDs. My sense is that the infosec community actually has good norms for thinking about and working on things-shaped-like-existential-risks, and the AI x-risk community should inherit those norms, not the norms of academic AI research.
- Shankar Sivarajan 27 Aug 2024 2:29 UTC
  3 points
  0
  Parent
  AI Safety Fundamentals listing like a dozen random leftist “AI issues”
  Amusingly, this post from yesterday praising BlueDot Impact for this was right below this one on my feed.
  - habryka 27 Aug 2024 2:36 UTC
    7 points
    0
    Parent
    Yeah, to be clear, these are correlated. I looked into the content based on seeing the ad yesterday (and also sent over a complaint to the BlueDot people).