Tamsin Leake comments on [missing post]

Tamsin Leake 12 Mar 2024 8:06 UTC
11 points
5
AI safety is easy. There’s a simple AI safety technique that guarantees that your AI won’t end the world, it’s called “delete it”.

AI alignment is hard.
- quetzal_rainbow 12 Mar 2024 8:33 UTC
  7 points
  2
  Parent
  It’s called “don’t build it”. Once you have what to delete, things can get complicated
  - Tamsin Leake 12 Mar 2024 8:39 UTC
    4 points
    0
    Parent
    Sure, this is just me adapting the idea to the framing people often have, of “what technique can you apply to an existing AI to make it safe”.
- ryan_greenblatt 12 Mar 2024 20:14 UTC
  4 points
  0
  Parent
  Perhaps the main goal of AI safety is to improve the final safety/usefulness pareto frontier we end up with when there are very powerful (and otherwise risky) AIs.
  
  Alignment is one mechanism that can improve the pareto frontier.
  
  Not using powerful AIs allows for establishing a low-usefulness, but high-safety point.
  
  (Usefulness and safety can blend into each other in many cases (e.g. not getting useful work out is itself dangerous), but I still think this is a useful approximate frame in many cases.)
- CstineSublime 12 Mar 2024 8:46 UTC
  3 points
  −1
  Parent
  Interesting, when you frame it like that though the hard part is enforcing it. And if I was being pithy I’d say something like: that involves human alignment, not AI
  - M. Y. Zuo 12 Mar 2024 12:06 UTC
    1 point
    0
    Parent
    “AI Safety”, especially enforcing anything, does pretty much boil down to human alignment, i.e. politics, but there are practically zero political geniuses among its proponent, so it needs to be dressed up a bit to sound even vaguely plausible.
    
    It’s a bit of a cottage industry nowadays.