AllJan

Stream Entry

lsusr7 Jan 2025 23:56 UTC
41 points
0 comments4 min readLW link

Don’t fall for on­tol­ogy pyra­mid schemes

Lorec7 Jan 2025 23:29 UTC
12 points
3 comments2 min readLW link

Bridge­wa­ter x Me­tac­u­lus Fore­cast­ing Con­test Goes Global — Feb 3, $25k, Opportunities

ChristianWilliams7 Jan 2025 21:40 UTC
10 points
0 comments1 min readLW link
(www.metaculus.com)

A Prin­ci­pled Car­toon Guide to NVC

7 Jan 2025 21:01 UTC
25 points
5 comments5 min readLW link

Disagree­ment on AGI Suggests It’s Near

tangerine7 Jan 2025 20:42 UTC
24 points
5 comments1 min readLW link

Role em­bed­dings: mak­ing au­thor­ship more salient to LLMs

7 Jan 2025 20:13 UTC
38 points
0 comments8 min readLW link

Will bird flu be the next Covid? “Lit­tle chance” says my dash­board.

Nathan Young7 Jan 2025 20:10 UTC
19 points
0 comments1 min readLW link

[Fic­tion] [Comic] Effec­tive Altru­ism and Ra­tion­al­ity meet at a Sec­u­lar Sols­tice afterparty

tandem7 Jan 2025 19:11 UTC
94 points
4 comments1 min readLW link

Pre­dict­ing AI Re­leases Through Side Channels

Reworr R7 Jan 2025 19:06 UTC
11 points
0 comments1 min readLW link

Re­but­tals for ~all crit­i­cisms of AIXI

Cole Wyeth7 Jan 2025 17:41 UTC
17 points
5 comments14 min readLW link

OpenAI #10: Reflections

Zvi7 Jan 2025 17:00 UTC
131 points
6 comments11 min readLW link
(thezvi.wordpress.com)

Other im­pli­ca­tions of rad­i­cal empathy

MichaelStJules7 Jan 2025 16:10 UTC
3 points
0 comments1 min readLW link

Ac­tu­al­ism, asym­me­try and extinction

MichaelStJules7 Jan 2025 16:02 UTC
−1 points
0 comments1 min readLW link

Med­i­ta­tion in­sights as phase shifts in your self-model

Jonas Hallgren7 Jan 2025 10:09 UTC
7 points
1 comment3 min readLW link

Alle­vi­at­ing shrimp pain is im­moral.

G Wood7 Jan 2025 7:28 UTC
−5 points
0 comments4 min readLW link

D&D.Sci Dun­geon­build­ing: the Dun­geon Tour­na­ment Eval­u­a­tion & Ruleset

aphyer7 Jan 2025 5:02 UTC
27 points
5 comments5 min readLW link

Incredibow

jefftk7 Jan 2025 3:30 UTC
17 points
3 comments1 min readLW link
(www.jefftk.com)

Build­ing Big Science from the Bot­tom-Up: A Frac­tal Ap­proach to AI Safety

Lauren Greenspan7 Jan 2025 3:08 UTC
37 points
2 comments12 min readLW link

My Ex­pe­rience With A Mag­net Implant

Vale7 Jan 2025 3:01 UTC
5 points
2 comments1 min readLW link
(vale.rocks)

You should de­lay en­g­ineer­ing-heavy re­search in light of R&D automation

Daniel Paleka7 Jan 2025 2:11 UTC
32 points
3 comments5 min readLW link
(newsletter.danielpaleka.com)

Test­ing for Schem­ing with Model Deletion

Guive7 Jan 2025 1:54 UTC
59 points
12 comments21 min readLW link
(guive.substack.com)

Guilt, Shame, and Depravity

Benquo7 Jan 2025 1:16 UTC
11 points
2 comments4 min readLW link

Turn­ing up the Heat on De­cep­tively-Misal­igned AI

J Bostock7 Jan 2025 0:13 UTC
19 points
15 comments4 min readLW link

(My) self-refer­en­tial rea­son to be­lieve in free will

jacek6 Jan 2025 23:35 UTC
16 points
5 comments1 min readLW link

[Question] Is my dis­tinc­tive­ness ev­i­dence for be­ing in a simu­la­tion?

AynonymousPrsn1236 Jan 2025 21:27 UTC
8 points
42 comments2 min readLW link

Defi­ni­tion of al­ign­ment sci­ence I like

quetzal_rainbow6 Jan 2025 20:40 UTC
19 points
0 comments3 min readLW link

How will we up­date about schem­ing?

ryan_greenblatt6 Jan 2025 20:21 UTC
128 points
4 comments36 min readLW link

What Indi­ca­tors Should We Watch to Disam­biguate AGI Timelines?

snewman6 Jan 2025 19:57 UTC
115 points
32 comments13 min readLW link

Gen­er­at­ing Cog­nate­ful Sen­tences with Large Lan­guage Models

vkethana6 Jan 2025 18:40 UTC
6 points
0 comments10 min readLW link

Really rad­i­cal empathy

MichaelStJules6 Jan 2025 17:46 UTC
19 points
0 comments1 min readLW link

In­de­pen­dent re­search ar­ti­cle an­a­lyz­ing con­sis­tent self-re­ports of ex­pe­rience in ChatGPT and Claude

rife6 Jan 2025 17:34 UTC
3 points
8 comments1 min readLW link
(awakenmoon.ai)

[Question] Meal Re­place­ments in 2025?

alkjash6 Jan 2025 15:37 UTC
19 points
9 comments1 min readLW link

AI safety con­tent you could create

Adam Jones6 Jan 2025 15:35 UTC
18 points
0 comments5 min readLW link
(adamjones.me)

Child­hood and Ed­u­ca­tion #8: Deal­ing with the Internet

Zvi6 Jan 2025 14:00 UTC
32 points
6 comments13 min readLW link
(thezvi.wordpress.com)

La­tent Ad­ver­sar­ial Train­ing (LAT) Im­proves the Rep­re­sen­ta­tion of Refusal

6 Jan 2025 10:24 UTC
17 points
5 comments10 min readLW link

Alter­na­tive Cancer Care As Bio­hack­ing & Book Re­view: Sur­viv­ing “Ter­mi­nal” Cancer

DenizT6 Jan 2025 7:43 UTC
31 points
4 comments15 min readLW link

Es­ti­mat­ing the benefits of a new flu drug (BXM)

DirectedEvolution6 Jan 2025 4:31 UTC
34 points
2 comments3 min readLW link

Mea­sur­ing Non­lin­ear Fea­ture In­ter­ac­tions in Sparse Cross­coders [Pro­ject Pro­posal]

6 Jan 2025 4:22 UTC
19 points
0 comments12 min readLW link

Speedrun­ning Ra­tion­al­ity: Day II

aproteinengine6 Jan 2025 3:59 UTC
6 points
3 comments2 min readLW link

“We know how to build AGI”—Sam Altman

Nikola Jurkovic6 Jan 2025 2:05 UTC
62 points
5 comments1 min readLW link
(blog.samaltman.com)

[Question] Is “hid­den com­plex­ity of wishes prob­lem” solved?

Roman Malov5 Jan 2025 22:59 UTC
10 points
4 comments1 min readLW link

A Ground-Level Per­spec­tive on Ca­pac­ity Build­ing in In­ter­na­tional Development

Sean Aubin5 Jan 2025 20:36 UTC
10 points
1 comment8 min readLW link

Why Lin­ear AI Safety Hits a Wall and How Frac­tal In­tel­li­gence Un­locks Non-Lin­ear Solutions

Andy E Williams5 Jan 2025 17:08 UTC
−3 points
6 comments5 min readLW link

How to Do a PhD (in AI Safety)

Lewis Hammond5 Jan 2025 16:57 UTC
6 points
0 comments1 min readLW link
(lewishammond.com)

Rea­sons for and against work­ing on tech­ni­cal AI safety at a fron­tier AI lab

bilalchughtai5 Jan 2025 14:49 UTC
89 points
12 comments12 min readLW link

Op­pres­sion and pro­duc­tion are com­pet­ing ex­pla­na­tions for wealth in­equal­ity.

Benquo5 Jan 2025 14:13 UTC
32 points
15 comments8 min readLW link
(benjaminrosshoffman.com)

Max­i­miz­ing Com­mu­ni­ca­tion, not Traffic

jefftk5 Jan 2025 13:00 UTC
133 points
7 comments1 min readLW link
(www.jefftk.com)

Poli­cy­mak­ers don’t have ac­cess to pay­walled articles

Adam Jones5 Jan 2025 10:56 UTC
17 points
4 comments2 min readLW link
(adamjones.me)

Cap­i­tal Own­er­ship Will Not Prevent Hu­man Disempowerment

beren5 Jan 2025 6:00 UTC
112 points
9 comments14 min readLW link

Chi­nese Re­searchers Crack ChatGPT: Repli­cat­ing OpenAI’s Ad­vanced AI Model

Evan_Gaensbauer5 Jan 2025 3:50 UTC
−8 points
1 comment1 min readLW link
(www.geeky-gadgets.com)