Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Walter Laurito
Karma:
110
All
Posts
Comments
New
Top
Old
Finding the estimate of the value of a state in RL agents
Clément Dumas
,
Walter Laurito
,
KlaRo
and
Kaarel
3 Jun 2024 20:26 UTC
7
points
4
comments
4
min read
LW
link
Searching for a model’s concepts by their shape – a theoretical framework
Kaarel
,
gekaklam
,
Walter Laurito
,
Kay Kozaronek
,
AlexMennen
and
June Ku
23 Feb 2023 20:14 UTC
51
points
0
comments
19
min read
LW
link
[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.
gekaklam
,
Walter Laurito
,
Kaarel
and
Kay Kozaronek
25 Jan 2023 19:03 UTC
48
points
6
comments
12
min read
LW
link
Back to top