Thanks, this is a useful corrective to the post! To shortcut safety to “would I trust my grandmother to use this without bad outcomes”, I would trust a current-gen LLM to be helpful and friendly with her, but I would absolutely fear her “learning” factually untrue things from it. While I think it can be useful to have separate concepts for hallucinations and “intentional lies” (as another commenter argues), I think “behavioral safety” should preclude both, in which case our LLMs are not behaviorally safe.
I think I may have overlooked hallucinations because I’ve internalized that LLMs are factually unreliable, so I don’t use LLMs where accuracy is critical, so I don’t see many hallucinations (which is not much of an endorsement of LLMs).
Thanks, this is a useful corrective to the post! To shortcut safety to “would I trust my grandmother to use this without bad outcomes”, I would trust a current-gen LLM to be helpful and friendly with her, but I would absolutely fear her “learning” factually untrue things from it. While I think it can be useful to have separate concepts for hallucinations and “intentional lies” (as another commenter argues), I think “behavioral safety” should preclude both, in which case our LLMs are not behaviorally safe.
I think I may have overlooked hallucinations because I’ve internalized that LLMs are factually unreliable, so I don’t use LLMs where accuracy is critical, so I don’t see many hallucinations (which is not much of an endorsement of LLMs).