An open re­sponse to Wit­tkot­ter and Yampolskiy

Donald Hobson24 Sep 2024 22:27 UTC
8 points
0 comments4 min readLW link

A Path out of In­suffi­cient Views

Unreal24 Sep 2024 20:00 UTC
55 points
46 comments9 min readLW link

How to give effec­tively to US Dems

Hauke Hillebrandt24 Sep 2024 14:38 UTC
2 points
0 comments1 min readLW link
(www.slowboring.com)

[Question] How do you fol­low AI (safety) news?

PeterH24 Sep 2024 13:58 UTC
4 points
2 comments1 min readLW link

In­struc­tion Fol­low­ing with­out In­struc­tion Tuning

Bogdan Ionut Cirstea24 Sep 2024 13:49 UTC
17 points
0 comments1 min readLW link
(arxiv.org)

Book Re­view: On the Edge: The Gamblers

Zvi24 Sep 2024 11:50 UTC
35 points
1 comment89 min readLW link
(thezvi.wordpress.com)

Edit­ing at the Take Level

jefftk24 Sep 2024 11:30 UTC
12 points
1 comment1 min readLW link
(www.jefftk.com)

Us­ing LLM’s for AI Foun­da­tion re­search and the Sim­ple Solu­tion assumption

Donald Hobson24 Sep 2024 11:00 UTC
5 points
0 comments2 min readLW link

When to join a re­spectabil­ity cascade

B Jacobs24 Sep 2024 7:54 UTC
10 points
1 comment2 min readLW link
(bobjacobs.substack.com)

Sam­pling Effects on Strate­gic Be­hav­ior in Su­per­vised Learn­ing Models

Phil Bland24 Sep 2024 7:44 UTC
1 point
0 comments6 min readLW link

In Praise of the Beatitudes

robotelvis24 Sep 2024 5:08 UTC
9 points
7 comments3 min readLW link
(messyprogress.substack.com)

[Question] What are the best ar­gu­ments for/​against AIs be­ing “slightly ‘nice’”?

Raemon24 Sep 2024 2:00 UTC
94 points
58 comments31 min readLW link

Strug­gling like a Shadowmoth

Raemon24 Sep 2024 0:47 UTC
176 points
38 comments7 min readLW link

Bounty for Ev­i­dence on Some of Pal­isade Re­search’s Beliefs

23 Sep 2024 20:01 UTC
46 points
4 comments2 min readLW link

Pre­dict­ing In­fluenza Abun­dance in Wastew­a­ter Me­tage­nomic Se­quenc­ing Data

jefftk23 Sep 2024 17:25 UTC
27 points
0 comments4 min readLW link
(naobservatory.org)

A primer on ML in an­ti­body engineering

Abhishaike Mahajan23 Sep 2024 17:03 UTC
11 points
0 comments25 min readLW link
(www.owlposting.com)

[Question] On the sub­ject of in-house large lan­guage mod­els ver­sus im­ple­ment­ing fron­tier models

Annapurna23 Sep 2024 15:00 UTC
7 points
1 comment1 min readLW link

A ba­sic sys­tems ar­chi­tec­ture for AI agents that do au­tonomous research

Buck23 Sep 2024 13:58 UTC
187 points
15 comments8 min readLW link

Book Re­view: On the Edge: The Fundamentals

Zvi23 Sep 2024 13:40 UTC
64 points
3 comments31 min readLW link
(thezvi.wordpress.com)

Switch­ing to a 4GB SD

jefftk23 Sep 2024 11:20 UTC
11 points
1 comment1 min readLW link
(www.jefftk.com)

Model evals for dan­ger­ous capabilities

Zach Stein-Perlman23 Sep 2024 11:00 UTC
51 points
11 comments3 min readLW link

Foun­da­tions—Why Bri­tain has stag­nated [cross­post]

Nathan Young23 Sep 2024 10:43 UTC
23 points
1 comment57 min readLW link
(ukfoundations.co)

Boons and banes

dkl923 Sep 2024 6:18 UTC
7 points
0 comments2 min readLW link
(dkl9.net)

The Sun is big, but su­per­in­tel­li­gences will not spare Earth a lit­tle sunlight

Eliezer Yudkowsky23 Sep 2024 3:39 UTC
205 points
141 comments13 min readLW link

GPT4o is still sen­si­tive to user-in­duced bias when writ­ing code

22 Sep 2024 21:04 UTC
6 points
0 comments4 min readLW link

My 10-year ret­ro­spec­tive on try­ing SSRIs

Kaj_Sotala22 Sep 2024 20:30 UTC
76 points
10 comments2 min readLW link
(kajsotala.fi)

Mak­ing Eggs Without Ovaries

22 Sep 2024 17:44 UTC
56 points
3 comments16 min readLW link
(www.asimov.press)

Becket First

jefftk22 Sep 2024 17:10 UTC
9 points
0 comments2 min readLW link
(www.jefftk.com)

On the Role of Proto-Languages

adamShimi22 Sep 2024 16:50 UTC
54 points
1 comment4 min readLW link
(epistemologicalfascinations.substack.com)

I’m cre­at­ing a deep dive pod­cast epi­sode about the origi­nal Lev­er­age Re­search—would you like to take part?

spencerg22 Sep 2024 14:03 UTC
37 points
2 comments1 min readLW link

Who Feels More Alone?

marvinscheffold22 Sep 2024 11:54 UTC
−8 points
2 comments39 min readLW link

Another ar­gu­ment against max­i­mizer-cen­tric al­ign­ment paradigms

Fiora from Rosebloom22 Sep 2024 7:28 UTC
64 points
39 comments8 min readLW link

My hopes for YouCongress.com

Nathan Helm-Burger22 Sep 2024 3:20 UTC
14 points
3 comments4 min readLW link

How Often Does Tak­ing Away Op­tions Help?

niplav21 Sep 2024 21:52 UTC
20 points
6 comments2 min readLW link

A Ra­tional Com­pany—Seek­ing Advisors

AlignmentOptimizer21 Sep 2024 19:51 UTC
0 points
1 comment1 min readLW link

Seek­ing mentorship

Kevin Afachao21 Sep 2024 16:54 UTC
5 points
0 comments1 min readLW link

Ap­pli­ca­tions of Chaos: Say­ing No (with Hast­ings Greer)

Elizabeth21 Sep 2024 16:30 UTC
50 points
16 comments2 min readLW link
(acesounderglass.com)

In­ves­ti­gat­ing an in­surance-for-AI startup

21 Sep 2024 15:29 UTC
69 points
0 comments16 min readLW link
(www.strataoftheworld.com)

An Un­mea­sured Song of Measurement

jan Sijan21 Sep 2024 15:08 UTC
−3 points
0 comments4 min readLW link

Should Sports Bet­ting Be Banned?

Maxwell Tabarrok21 Sep 2024 14:13 UTC
18 points
2 comments4 min readLW link
(www.maximum-progress.com)

Work with me on agent foun­da­tions: in­de­pen­dent fellowship

Alex_Altair21 Sep 2024 13:59 UTC
49 points
5 comments3 min readLW link

Elec­tric Mandola

jefftk21 Sep 2024 13:40 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

Glitch To­ken Cat­a­log - (Al­most) a Full Clear

Lao Mein21 Sep 2024 12:22 UTC
38 points
3 comments37 min readLW link

The Other Ex­is­ten­tial Crisis

James Stephen Brown21 Sep 2024 1:16 UTC
9 points
24 comments2 min readLW link

Ap­ply to MATS 7.0!

21 Sep 2024 0:23 UTC
31 points
0 comments5 min readLW link

Moscow – ACX Mee­tups Every­where Fall 2024

red-hara20 Sep 2024 23:03 UTC
−1 points
0 comments1 min readLW link

Val­i­dat­ing /​ find­ing al­ign­ment-rele­vant con­cepts us­ing neu­ral data

Bogdan Ionut Cirstea20 Sep 2024 21:12 UTC
7 points
0 comments1 min readLW link
(docs.google.com)

Aug­ment­ing Statis­ti­cal Models with Nat­u­ral Lan­guage Parameters

jsteinhardt20 Sep 2024 18:30 UTC
34 points
0 comments8 min readLW link
(bounded-regret.ghost.io)

Fun With The Tab­ula Muris (Se­nis)

sarahconstantin20 Sep 2024 18:20 UTC
25 points
0 comments8 min readLW link
(sarahconstantin.substack.com)

My Cri­tique of Effec­tive Altruism

Dylan Price20 Sep 2024 17:41 UTC
−10 points
7 comments4 min readLW link