RSS

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman6 Nov 2024 16:00 UTC
70 points
13 comments1 min readLW link
(alignment.anthropic.com)

Meme Talk­ing Points

ymeskhout6 Nov 2024 15:27 UTC
15 points
0 comments3 min readLW link

LDT (and ev­ery­thing else) can be irrational

Christopher King6 Nov 2024 4:05 UTC
4 points
3 comments2 min readLW link

Grace­ful Degradation

Screwtape5 Nov 2024 23:57 UTC
44 points
0 comments4 min readLW link

An al­ter­na­tive ap­proach to superbabies

Towards_Keeperhood5 Nov 2024 22:56 UTC
45 points
6 comments3 min readLW link

Go­ing Beyond “im­ma­tu­rity”

moisentinel5 Nov 2024 20:51 UTC
−3 points
1 comment2 min readLW link

In­tent al­ign­ment as a step­ping-stone to value alignment

Seth Herd5 Nov 2024 20:43 UTC
31 points
4 comments3 min readLW link

Why Re­cur­sion Phar­ma­ceu­ti­cals aban­doned cell paint­ing for bright­field imaging

Abhishaike Mahajan5 Nov 2024 14:51 UTC
25 points
1 comment18 min readLW link
(www.owlposting.com)

Win­ning isn’t enough

5 Nov 2024 11:37 UTC
23 points
12 comments9 min readLW link

An­thropic—The case for tar­geted regulation

anaguma5 Nov 2024 7:07 UTC
11 points
0 comments2 min readLW link
(www.anthropic.com)

The Shal­low Bench

Karl Faulks5 Nov 2024 5:07 UTC
43 points
5 comments3 min readLW link

Us­ing Nar­ra­tive Prompt­ing to Ex­tract Policy Fore­casts from LLMs

Max Ghenis5 Nov 2024 4:37 UTC
5 points
0 comments1 min readLW link

ML4Good (AI Safety Boot­camp) - Ex­pe­rience report

JanEbbing5 Nov 2024 1:18 UTC
4 points
0 comments3 min readLW link

[Question] Could or­cas be (trained to be) smarter than hu­mans? 

Towards_Keeperhood4 Nov 2024 23:29 UTC
48 points
5 comments1 min readLW link

Me­tastatic Cancer Treat­ment Since 2010: The Suc­cess Stories

sarahconstantin4 Nov 2024 22:50 UTC
38 points
0 comments6 min readLW link
(sarahconstantin.substack.com)

Em­pa­thy/​Sys­tem­iz­ing Quo­tient is a poor/​bi­ased model for the autism/​sex link

tailcalled4 Nov 2024 21:11 UTC
31 points
0 comments7 min readLW link

What if mus­cle ten­sion is some­times sig­nal jam­ming?

Chipmonk4 Nov 2024 21:08 UTC
11 points
1 comment1 min readLW link
(chrislakin.blog)

Distributed espionage

margetmagenta4 Nov 2024 19:43 UTC
3 points
0 comments1 min readLW link

We can survive

Oxidize4 Nov 2024 19:33 UTC
−11 points
3 comments2 min readLW link

GPT-8 may not be ASI

rvzlxax4094 Nov 2024 19:31 UTC
−4 points
0 comments3 min readLW link