Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Anthropic: Three Sketches of ASL-4 Safety Case Components
Zach Stein-Perlman
6 Nov 2024 16:00 UTC
70
points
13
comments
1
min read
LW
link
(alignment.anthropic.com)
Meme Talking Points
ymeskhout
6 Nov 2024 15:27 UTC
15
points
0
comments
3
min read
LW
link
LDT (and everything else) can be irrational
Christopher King
6 Nov 2024 4:05 UTC
4
points
3
comments
2
min read
LW
link
Graceful Degradation
Screwtape
5 Nov 2024 23:57 UTC
44
points
0
comments
4
min read
LW
link
An alternative approach to superbabies
Towards_Keeperhood
5 Nov 2024 22:56 UTC
45
points
6
comments
3
min read
LW
link
Going Beyond “immaturity”
moisentinel
5 Nov 2024 20:51 UTC
−3
points
1
comment
2
min read
LW
link
Intent alignment as a stepping-stone to value alignment
Seth Herd
5 Nov 2024 20:43 UTC
31
points
4
comments
3
min read
LW
link
Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging
Abhishaike Mahajan
5 Nov 2024 14:51 UTC
25
points
1
comment
18
min read
LW
link
(www.owlposting.com)
Winning isn’t enough
JesseClifton
and
Anthony DiGiovanni
5 Nov 2024 11:37 UTC
23
points
12
comments
9
min read
LW
link
Anthropic—The case for targeted regulation
anaguma
5 Nov 2024 7:07 UTC
11
points
0
comments
2
min read
LW
link
(www.anthropic.com)
The Shallow Bench
Karl Faulks
5 Nov 2024 5:07 UTC
43
points
5
comments
3
min read
LW
link
Using Narrative Prompting to Extract Policy Forecasts from LLMs
Max Ghenis
5 Nov 2024 4:37 UTC
5
points
0
comments
1
min read
LW
link
ML4Good (AI Safety Bootcamp) - Experience report
JanEbbing
5 Nov 2024 1:18 UTC
4
points
0
comments
3
min read
LW
link
[Question]
Could orcas be (trained to be) smarter than humans?
Towards_Keeperhood
4 Nov 2024 23:29 UTC
48
points
5
comments
1
min read
LW
link
Metastatic Cancer Treatment Since 2010: The Success Stories
sarahconstantin
4 Nov 2024 22:50 UTC
38
points
0
comments
6
min read
LW
link
(sarahconstantin.substack.com)
Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link
tailcalled
4 Nov 2024 21:11 UTC
31
points
0
comments
7
min read
LW
link
What if muscle tension is sometimes signal jamming?
Chipmonk
4 Nov 2024 21:08 UTC
11
points
1
comment
1
min read
LW
link
(chrislakin.blog)
Distributed espionage
margetmagenta
4 Nov 2024 19:43 UTC
3
points
0
comments
1
min read
LW
link
We can survive
Oxidize
4 Nov 2024 19:33 UTC
−11
points
3
comments
2
min read
LW
link
GPT-8 may not be ASI
rvzlxax409
4 Nov 2024 19:31 UTC
−4
points
0
comments
3
min read
LW
link
Back to top
Next