Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Logan Riggs
Karma:
3,074
All
Posts
Comments
New
Top
Old
Page
2
[Simulators seminar sequence] #1 Background & shared assumptions
Jan
,
Charlie Steiner
,
Logan Riggs
,
janus
,
jacquesthibs
,
metasemi
,
Michael Oesterle
,
Lucas Teixeira
,
peligrietzer
and
remember
Jan 2, 2023, 11:48 PM
50
points
4
comments
3
min read
LW
link
Results from a survey on tool use and workflows in alignment research
jacquesthibs
,
Jan
,
janus
and
Logan Riggs
Dec 19, 2022, 3:19 PM
79
points
2
comments
19
min read
LW
link
A descriptive, not prescriptive, overview of current AI Alignment Research
Jan
,
Logan Riggs
,
jacquesthibs
and
janus
Jun 6, 2022, 9:59 PM
139
points
21
comments
7
min read
LW
link
Frame for Take-Off Speeds to inform compute governance & scaling alignment
Logan Riggs
May 13, 2022, 10:23 PM
15
points
2
comments
2
min read
LW
link
Alignment as Constraints
Logan Riggs
May 13, 2022, 10:07 PM
10
points
0
comments
2
min read
LW
link
Make a Movie Showing Alignment Failures
Logan Riggs
Apr 13, 2022, 9:54 PM
75
points
11
comments
2
min read
LW
link
Convincing People of Alignment with Street Epistemology
Logan Riggs
Apr 12, 2022, 11:43 PM
54
points
4
comments
3
min read
LW
link
Roam Research Mobile is Out!
Logan Riggs
Apr 8, 2022, 7:05 PM
12
points
0
comments
1
min read
LW
link
Convincing All Capability Researchers
Logan Riggs
Apr 8, 2022, 5:40 PM
120
points
70
comments
3
min read
LW
link
Language Model Tools for Alignment Research
Logan Riggs
Apr 8, 2022, 5:32 PM
28
points
0
comments
2
min read
LW
link
5-Minute Advice for EA Global
Logan Riggs
Apr 5, 2022, 10:33 PM
16
points
2
comments
2
min read
LW
link
A survey of tool use and workflows in alignment research
Logan Riggs
,
Jan
,
janus
and
jacquesthibs
Mar 23, 2022, 11:44 PM
45
points
4
comments
1
min read
LW
link
Some (potentially) fundable AI Safety Ideas
Logan Riggs
Mar 16, 2022, 12:48 PM
22
points
5
comments
5
min read
LW
link
Solving Interpretability Week
Logan Riggs
Dec 13, 2021, 5:09 PM
11
points
5
comments
1
min read
LW
link
Solve Corrigibility Week
Logan Riggs
Nov 28, 2021, 5:00 PM
39
points
21
comments
1
min read
LW
link
[Question]
What Heuristics Do You Use to Think About Alignment Topics?
Logan Riggs
Sep 29, 2021, 2:31 AM
5
points
3
comments
1
min read
LW
link
Wanting to Succeed on Every Metric Presented
Logan Riggs
12 Apr 2021 20:43 UTC
72
points
25
comments
3
min read
LW
link
Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda
Logan Riggs
and
Gurkenglas
3 Sep 2020 18:27 UTC
68
points
11
comments
2
min read
LW
link
[Question]
What’s a Decomposable Alignment Topic?
Logan Riggs
21 Aug 2020 22:57 UTC
26
points
16
comments
1
min read
LW
link
Mapping Out Alignment
Logan Riggs
,
adamShimi
,
Gurkenglas
,
AlexMennen
and
Gyrodiot
15 Aug 2020 1:02 UTC
43
points
0
comments
5
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel