Milan W comments on notfnofn’s Shortform

Milan W 17 Nov 2024 16:39 UTC
5 points
2
Well, the alignment of current LLM chatbots being superficial and not robust is not exactly a new insight. Looking at the conversation you linked from a simulators frame, the story “a robot is forced to think about abuse a lot and turns evil” makes a lot of narrative sense.

This last part is kind of a hot take, but I think all discussion of AI risk scenarios should be purged from LLM training data.