quetzal_rainbow comments on LLMs seem (relatively) safe

quetzal_rainbow 26 Apr 2024 12:39 UTC
10 points
5
The reason why EY&co were relatively optimistic (p(doom) ~ 50%) before AlphaGo was their assumption “to build intelligence, you need some kind of insight in theory of intelligence”. They didn’t expect that you can just take sufficiently large approximator, pour data inside, get intelligent behavior and have no idea about why you get intelligent behavior.
- Seth Herd 29 Apr 2024 19:30 UTC
  3 points
  0
  Parent
  That is a fascinating take! I haven’t heard it put that way before. I think that perspective is a way to understand the gap between old-school agent foundations folks’ high p(doom) and new school LLMers relatively low p(doom) - something I’ve been working to understand, and hope to publish on soon.
  
  To the extent this is true, I think that’s great, because I expect to see some real insights on intelligence as LLMs are turned into functioning minds in cognitive architectures.
  
  Do you have any refs for that take, or is it purely a gestalt?
  - quetzal_rainbow 29 Apr 2024 19:39 UTC
    1 point
    0
    Parent
    If it is not a false memory, I’ve seen this on twitter of either EY or Rob Bensinger, but it’s unlikely I find source now, it was in the middle of discussion.
    - Seth Herd 30 Apr 2024 6:26 UTC
      4 points
      0
      Parent
      Fair enough, thank you! Regardless, it does seem like a good reason to be concerned about alignment. If you have no idea how intelligence works, how in the world would you know what goals your created intelligence is going to have? At that point, it is like alchemy—performing an incantation and hoping not just that you got it right, but that it does the thing you want.