Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Kay Kozaronek
Karma:
184
All
Posts
Comments
New
Top
Old
Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks
Tom DAVID
,
Pierre Peigné
,
Quentin FEUILLADE--MONTIXI
,
Kay Kozaronek
and
Miailhe Nicolas
Dec 11, 2024, 1:37 PM
8
points
3
comments
2
min read
LW
link
Searching for a model’s concepts by their shape – a theoretical framework
Kaarel
,
gekaklam
,
Walter Laurito
,
Kay Kozaronek
,
AlexMennen
and
June Ku
Feb 23, 2023, 8:14 PM
51
points
0
comments
19
min read
LW
link
[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.
gekaklam
,
Walter Laurito
,
Kaarel
and
Kay Kozaronek
Jan 25, 2023, 7:03 PM
48
points
6
comments
12
min read
LW
link
Reinforcement Learning Study Group
Kay Kozaronek
Dec 26, 2021, 11:11 PM
20
points
8
comments
1
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel