Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Kay Kozaronek
Karma:
184
All
Posts
Comments
New
Top
Old
Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks
Tom DAVID
,
Pierre Peigné
,
Quentin FEUILLADE--MONTIXI
,
Kay Kozaronek
and
Miailhe Nicolas
11 Dec 2024 13:37 UTC
4
points
3
comments
2
min read
LW
link
Searching for a model’s concepts by their shape – a theoretical framework
Kaarel
,
gekaklam
,
Walter Laurito
,
Kay Kozaronek
,
AlexMennen
and
June Ku
23 Feb 2023 20:14 UTC
51
points
0
comments
19
min read
LW
link
[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.
gekaklam
,
Walter Laurito
,
Kaarel
and
Kay Kozaronek
25 Jan 2023 19:03 UTC
48
points
6
comments
12
min read
LW
link
Reinforcement Learning Study Group
Kay Kozaronek
26 Dec 2021 23:11 UTC
20
points
8
comments
1
min read
LW
link
Back to top