Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Josh Levy
Karma:
42
All
Posts
Comments
New
Top
Old
Is This Lie Detector Really Just a Lie Detector? An Investigation of LLM Probe Specificity.
Josh Levy
4 Jun 2024 15:45 UTC
38
points
0
comments
17
min read
LW
link
Open Source LLMs Can Now Actively Lie
Josh Levy
1 Jun 2023 22:03 UTC
6
points
0
comments
3
min read
LW
link
Back to top