Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
gasteigerjo
Karma:
171
Working on Alignment Science at Anthropic
All
Posts
Comments
New
Top
Old
Paper Highlights, November ’24
gasteigerjo
7 Dec 2024 19:15 UTC
7
points
0
comments
8
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, October ’24
gasteigerjo
31 Oct 2024 0:09 UTC
3
points
0
comments
9
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, September ’24
gasteigerjo
2 Oct 2024 9:49 UTC
13
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, August ’24
gasteigerjo
3 Sep 2024 19:17 UTC
28
points
0
comments
6
min read
LW
link
(aisafetyfrontier.substack.com)
AI Safety at the Frontier: Paper Highlights, July ’24
gasteigerjo
5 Aug 2024 13:00 UTC
8
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
Discussion: Challenges with Unsupervised LLM Knowledge Discovery
Seb Farquhar
,
Vikrant Varma
,
zac_kenton
,
gasteigerjo
,
Vlad Mikulik
and
Rohin Shah
18 Dec 2023 11:58 UTC
147
points
21
comments
10
min read
LW
link
Back to top