Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
hal2k
Karma:
16
All
Posts
Comments
New
Top
Old
Latent Adversarial Training (LAT) Improves the Representation of Refusal
alexandraabbas
,
nlpet
and
hal2k
6 Jan 2025 10:24 UTC
17
points
5
comments
10
min read
LW
link
Back to top