RSS

$500 Bounty Prob­lem: Are (Ap­prox­i­mately) Deter­minis­tic Nat­u­ral La­tents All You Need?

21 Apr 2025 20:19 UTC
67 points
1 comment3 min readLW link

More Than Just A, T, C, and G: Screen­ing for Hid­den Dangers in DNA Sequences

sgd21 Apr 2025 20:12 UTC
1 point
0 comments11 min readLW link

Pod­cast on “AI tools for ex­is­ten­tial se­cu­rity” — transcript

21 Apr 2025 19:26 UTC
11 points
0 comments43 min readLW link
(pnc.st)

Im­pli­ca­tions for the like­li­hood of hu­man ex­tinc­tion from the re­cent dis­cov­ery of pos­si­ble micro­bial life

Mvolz21 Apr 2025 19:15 UTC
0 points
2 comments1 min readLW link

Key event tracker for AI2027

MarkelKori21 Apr 2025 19:02 UTC
1 point
0 comments1 min readLW link

The Uses of Complacency

sarahconstantin21 Apr 2025 18:50 UTC
50 points
3 comments8 min readLW link
(sarahconstantin.substack.com)

Fea­ture-Based Anal­y­sis of Safety-Rele­vant Multi-Agent Behavior

21 Apr 2025 18:12 UTC
2 points
0 comments5 min readLW link

Crime and Pu­n­ish­ment #1

Zvi21 Apr 2025 15:30 UTC
37 points
4 comments39 min readLW link
(thezvi.wordpress.com)

Im­prov­ing CNNs with Klein Net­works: A Topolog­i­cal Ap­proach to AI

Gunnar Carlsson21 Apr 2025 15:21 UTC
17 points
4 comments5 min readLW link

Eu­logy to the Obits

21 Apr 2025 14:10 UTC
2 points
1 comment10 min readLW link

Re­search Notes: Run­ning Claude 3.7, Gem­ini 2.5 Pro, and o3 on Poké­mon Red

Julian Bradshaw21 Apr 2025 3:52 UTC
88 points
10 comments14 min readLW link

Not All Beliefs Are Created Equal: Di­ag­nos­ing Toxic Ideologies

Big_friendly_kiwi21 Apr 2025 3:18 UTC
14 points
5 comments9 min readLW link

AI 2027 is a Bet Against Am­dahl’s Law

snewman21 Apr 2025 3:09 UTC
101 points
26 comments9 min readLW link

Sev­er­ance and the Ethics of the Con­scious Agents

Crissman21 Apr 2025 2:21 UTC
3 points
0 comments1 min readLW link

March-April 2025 Progress in Guaran­teed Safe AI

Quinn20 Apr 2025 19:00 UTC
6 points
0 comments4 min readLW link
(gsai.substack.com)

How to end credentialism

Yair Halberstadt20 Apr 2025 18:50 UTC
13 points
14 comments8 min readLW link

Spend­ing on Ourselves

jefftk20 Apr 2025 18:40 UTC
21 points
0 comments3 min readLW link
(www.jefftk.com)

In­ter­est­ing ACX 2024 Book Re­view Entries

jenn20 Apr 2025 18:10 UTC
23 points
1 comment4 min readLW link

[Question] To what ethics is an AGI ac­tu­ally safely al­ignable?

StanislavKrym20 Apr 2025 17:09 UTC
1 point
6 comments4 min readLW link

Eval­u­at­ing Over­sight Ro­bust­ness with In­cen­tivized Re­ward Hacking

20 Apr 2025 16:53 UTC
1 point
0 comments15 min readLW link