The Cog­ni­tive Boot­camp Agreement

Raemon16 Oct 2024 23:24 UTC
34 points
0 comments8 min readLW link

Bit­ter les­sons about lu­cid dreaming

avturchin16 Oct 2024 21:27 UTC
77 points
62 comments2 min readLW link

Towards Quan­ti­ta­tive AI Risk Management

16 Oct 2024 19:26 UTC
28 points
1 comment6 min readLW link

Why Academia is Mostly Not Truth-Seeking

Zero Contradictions16 Oct 2024 19:14 UTC
−7 points
6 comments1 min readLW link
(thewaywardaxolotl.blogspot.com)

Launch­ing Ad­ja­cent News

Lucas Kohorst16 Oct 2024 17:58 UTC
23 points
0 comments4 min readLW link

[Question] In­ter­est in Leet­code, but for Ra­tion­al­ity?

Gregory 16 Oct 2024 17:54 UTC
74 points
20 comments2 min readLW link

Re­quest for ad­vice: Re­search for Con­ver­sa­tional Game The­ory for LLMs

Rome Viharo16 Oct 2024 17:53 UTC
10 points
0 comments1 min readLW link

Why hu­mans won’t con­trol su­per­hu­man AIs.

Spiritus Dei16 Oct 2024 16:48 UTC
−11 points
1 comment6 min readLW link

Against em­pa­thy-by-default

Steven Byrnes16 Oct 2024 16:38 UTC
60 points
24 comments7 min readLW link

can­cer rates af­ter gene therapy

bhauth16 Oct 2024 15:32 UTC
49 points
0 comments3 min readLW link
(bhauth.com)

Monthly Roundup #23: Oc­to­ber 2024

Zvi16 Oct 2024 13:50 UTC
39 points
13 comments50 min readLW link
(thezvi.wordpress.com)

[Question] Change My Mind: Thirders in “Sleep­ing Beauty” are Just Do­ing Episte­mol­ogy Wrong

DragonGod16 Oct 2024 10:20 UTC
8 points
67 comments6 min readLW link

[Question] After up­load­ing your con­scious­ness...

Jinge Wang16 Oct 2024 3:52 UTC
−2 points
0 comments1 min readLW link

The ELYSIUM Pro­posal - Ex­trap­o­lated voLi­tions Yield­ing Separate In­di­vi­d­u­al­ized Utopias for Mankind

Roko16 Oct 2024 1:24 UTC
10 points
18 comments1 min readLW link
(transhumanaxiology.substack.com)

Bel­le­vue Meetup

Cedar16 Oct 2024 1:07 UTC
3 points
0 comments1 min readLW link

Sin­gu­lar Learn­ing The­ory for Dummies

Rahul Chand15 Oct 2024 21:13 UTC
2 points
0 comments8 min readLW link

Distil­la­tion Of Deep­Seek-Prover V1.5

IvanLin15 Oct 2024 18:53 UTC
4 points
1 comment3 min readLW link

Im­prov­ing Model-Writ­ten Evals for AI Safety Benchmarking

15 Oct 2024 18:25 UTC
27 points
0 comments18 min readLW link

Tak­ing non­log­i­cal con­cepts seriously

Kris Brown15 Oct 2024 18:16 UTC
7 points
5 comments18 min readLW link
(topos.site)

Rashomon—A news­bet­ting site

ideasthete15 Oct 2024 18:15 UTC
23 points
8 comments1 min readLW link

On the Prac­ti­cal Ap­pli­ca­tions of Interpretability

Nick Jiang15 Oct 2024 17:18 UTC
3 points
1 comment7 min readLW link

An­thropic’s up­dated Re­spon­si­ble Scal­ing Policy

Zac Hatfield-Dodds15 Oct 2024 16:46 UTC
51 points
3 comments3 min readLW link
(www.anthropic.com)

[Question] When is re­ward ever the op­ti­miza­tion tar­get?

Noosphere8915 Oct 2024 15:09 UTC
35 points
12 comments1 min readLW link

An Opinionated Evals Read­ing List

15 Oct 2024 14:38 UTC
65 points
0 comments13 min readLW link
(www.apolloresearch.ai)

An­thropic rewrote its RSP

Zach Stein-Perlman15 Oct 2024 14:25 UTC
46 points
19 comments6 min readLW link

[In­tu­itive self-mod­els] 5. Dis­so­ci­a­tive Iden­tity (Mul­ti­ple Per­son­al­ity) Disorder

Steven Byrnes15 Oct 2024 13:31 UTC
58 points
7 comments11 min readLW link

Eco­nomics Roundup #4

Zvi15 Oct 2024 13:20 UTC
19 points
4 comments25 min readLW link
(thezvi.wordpress.com)

[Question] Is School of Thought re­lated to the Ra­tion­al­ity Com­mu­nity?

Shoshannah Tekofsky15 Oct 2024 12:41 UTC
7 points
11 comments1 min readLW link

In­verse Prob­lems In Every­day Life

silentbob15 Oct 2024 11:42 UTC
14 points
2 comments8 min readLW link

Think­ing LLMs: Gen­eral In­struc­tion Fol­low­ing with Thought Generation

Bogdan Ionut Cirstea15 Oct 2024 9:21 UTC
7 points
0 comments1 min readLW link
(arxiv.org)

Thoughts On the Na­ture of Ca­pa­bil­ity Elic­i­ta­tion via Fine-tuning

Theodore Chapman15 Oct 2024 8:39 UTC
8 points
0 comments8 min readLW link

Min­i­mal Mo­ti­va­tion of Nat­u­ral Latents

14 Oct 2024 22:51 UTC
45 points
14 comments3 min readLW link

How long should poli­ti­cal (and other) terms be?

ohmurphy14 Oct 2024 21:38 UTC
5 points
0 comments1 min readLW link
(ohmurphy.substack.com)

Ex­am­ples of How I Use LLMs

jefftk14 Oct 2024 17:10 UTC
29 points
2 comments2 min readLW link
(www.jefftk.com)

It’s im­por­tant to know when to stop: Mechanis­tic Ex­plo­ra­tion of Gemma 2 List Generation

Gerard Boxo14 Oct 2024 17:04 UTC
8 points
0 comments6 min readLW link
(gboxo.github.io)

[Question] LW re­sources on child­hood ex­pe­riences?

nahir9159514 Oct 2024 17:04 UTC
10 points
7 comments1 min readLW link

Free Will, Neu­rotyp­i­cal Dom­i­nance, and the Path to ASI and Neu­ral­inks: Evolv­ing Beyond Scarcity

j_passeri14 Oct 2024 16:54 UTC
−2 points
3 comments3 min readLW link

Break­throughs, Neu­ro­di­ver­gence, and Work­ing Out­side the System

j_passeri14 Oct 2024 16:54 UTC
1 point
3 comments2 min readLW link

The case for un­learn­ing that re­moves in­for­ma­tion from LLM weights

Fabien Roger14 Oct 2024 14:08 UTC
96 points
15 comments6 min readLW link

Cir­cuits in Su­per­po­si­tion: Com­press­ing many small neu­ral net­works into one

14 Oct 2024 13:06 UTC
127 points
8 comments13 min readLW link

Beyond Defen­sive Technology

ejk6414 Oct 2024 11:34 UTC
11 points
1 comment10 min readLW link

Why Stop AI is bar­ri­cad­ing OpenAI

Remmelt14 Oct 2024 7:12 UTC
−16 points
32 comments1 min readLW link
(docs.google.com)

The Ex­plore vs. Ex­ploit Dilemma

nathanjzhao14 Oct 2024 6:20 UTC
1 point
0 comments1 min readLW link
(nathanzhao.cc)

AI Align­ment via Slow Sub­strates: Early Em­piri­cal Re­sults With StarCraft II

Lester Leong14 Oct 2024 4:05 UTC
60 points
9 comments12 min readLW link

some ques­tion­able space launch guns

bhauth13 Oct 2024 22:52 UTC
17 points
0 comments4 min readLW link
(bhauth.com)

[Question] What are your fa­vorite books or blogs that are out of print, or whose do­mains have ex­pired (es­pe­cially if they also aren’t on LibGen/​Way­back/​etc, or on Ama­zon)?

Arjun Panickssery13 Oct 2024 20:21 UTC
13 points
4 comments1 min readLW link

The Hopium Wars: the AGI En­tente Delusion

Max Tegmark13 Oct 2024 17:00 UTC
200 points
55 comments9 min readLW link

Parental Writ­ing Selec­tion Bias

jefftk13 Oct 2024 14:00 UTC
52 points
3 comments1 min readLW link
(www.jefftk.com)

Per­sonal Philosophy

Xor13 Oct 2024 3:01 UTC
3 points
0 comments2 min readLW link

Con­ta­gious Beliefs—Si­mu­lat­ing Poli­ti­cal Alignment

James Stephen Brown13 Oct 2024 0:27 UTC
8 points
0 comments2 min readLW link
(nonzerosum.games)