Amalthea comments on AI Alignment Breakthroughs this Week [new substack]

Amalthea 2 Oct 2023 8:22 UTC
8 points
16
In general, it seems interesting to track the state of AI-alignment techniques, and how different ideas develop!

I strongly suggest not using the term “Breakthrough” so casually, in order to avoid unnecessary hype. It’s unclear we had any alignment breakthrough so far, and talking about “weekly breakthroughs” seems absurd at best.
- Logan Zoellner 2 Oct 2023 10:15 UTC
  1 point
  0
  Parent
  I don’t think the word “breakthrough” is reserved exclusively for “we have solved the AI alignment problem in its entirety”. I am using it here to mean “this is a significant new discovery that advances the state of the art”.
  If you don’t think there are weekly breakthoughs in AI, you haven’t been paying attention to AI.
  - Amalthea 2 Oct 2023 13:03 UTC
    1 point
    0
    Parent
    It sounds like what you call a breakthrough, I’d just call a “result”. In my understanding, it’d either have to open up an unexpected + promising new direction, or solve a longstanding problem in order to be considered a breakthrough.
    
    Unfortunately, significant insights into alignment seem much rarer than “capabilities breakthroughs” (which are probably also more due to an accumulation of smaller insights, so even there one might simply say the field is moving fast)