Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
hugofry
Karma:
136
All
Posts
Comments
New
Top
Old
An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
hugofry
,
Ahmed Abdulaal
,
NMontanaBrown
and
a-ijishakin
7 Oct 2024 8:53 UTC
38
points
0
comments
5
min read
LW
link
(arxiv.org)
Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers
hugofry
29 Apr 2024 20:57 UTC
89
points
8
comments
11
min read
LW
link
Robustness of Contrast-Consistent Search to Adversarial Prompting
Nandi
,
i
,
Jamie Wright
,
Seamus_F
and
hugofry
1 Nov 2023 12:46 UTC
18
points
1
comment
7
min read
LW
link
Back to top