Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Henry Sleight
Karma:
173
All
Posts
Comments
New
Top
Old
MATS Winter 2023-24 Retrospective
Rocket
,
Ryan Kidd
,
LauraVaughan
,
McKennaFitzgerald
,
Christian Smith
,
Juan Gil
,
Henry Sleight
and
Matthew Wearden
11 May 2024 0:09 UTC
76
points
28
comments
49
min read
LW
link
Inducing Unprompted Misalignment in LLMs
Sam Svenningsen
,
evhub
and
Henry Sleight
19 Apr 2024 20:00 UTC
37
points
6
comments
16
min read
LW
link
How I select alignment research projects
Ethan Perez
,
Henry Sleight
and
Mikita Balesni
10 Apr 2024 4:33 UTC
34
points
4
comments
24
min read
LW
link
Templates I made to run feedback rounds for Ethan Perez’s research fellows.
Henry Sleight
28 Mar 2024 19:41 UTC
31
points
0
comments
10
min read
LW
link
Reading writing advice doesn’t make writing easier
Henry Sleight
7 Feb 2024 19:14 UTC
17
points
0
comments
5
min read
LW
link
(open.substack.com)
Back to top