Where Does Ad­ver­sar­ial Pres­sure Come From?

quetzal_rainbow14 Dec 2023 22:31 UTC
16 points
1 comment2 min readLW link

Epoch wise crit­i­cal pe­ri­ods, and sin­gu­lar learn­ing theory

Garrett Baker14 Dec 2023 20:55 UTC
9 points
1 comment5 min readLW link

OpenAI Su­per­al­ign­ment: Weak-to-strong generalization

Dalmert14 Dec 2023 19:47 UTC
25 points
3 comments1 min readLW link
(openai.com)

Ap­pli­ca­tions for EA Global are still open!

Eli_Nathan14 Dec 2023 19:10 UTC
1 point
0 comments1 min readLW link

Per­sonal Devel­op­ment Sys­tem: Win­ning Re­peat­edly and Grow­ing Effec­tively With The BIG4

Paul Rohde14 Dec 2023 18:49 UTC
13 points
0 comments33 min readLW link
(blog.paul-rohde.com)

In­tro­duc­ing The ‘From Big Ideas To Real-World Re­sults’: A Series for Effec­tive Per­sonal Development

Paul Rohde14 Dec 2023 18:49 UTC
13 points
1 comment8 min readLW link
(blog.paul-rohde.com)

Talk­ing With Peo­ple Who Speak to Con­gres­sional Staffers about AI risk

Eneasz14 Dec 2023 17:55 UTC
32 points
0 comments1 min readLW link
(www.thebayesianconspiracy.com)

Bayesian Injustice

Kevin Dorst14 Dec 2023 15:44 UTC
124 points
10 comments6 min readLW link
(kevindorst.substack.com)

AI #42: The Wrong Answer

Zvi14 Dec 2023 14:50 UTC
67 points
6 comments54 min readLW link
(thezvi.wordpress.com)

Some for-profit AI al­ign­ment org ideas

Eric Ho14 Dec 2023 14:23 UTC
86 points
19 comments9 min readLW link

Map­ping the se­man­tic void: Strange go­ings-on in GPT em­bed­ding spaces

mwatkins14 Dec 2023 13:10 UTC
114 points
31 comments14 min readLW link

Cat­e­gor­i­cal Or­ga­ni­za­tion in Me­mory: ChatGPT Or­ga­nizes the 665 Topic Tags from My New Sa­vanna Blog

Bill Benzon14 Dec 2023 13:02 UTC
0 points
6 comments2 min readLW link

Mo­ral Mountains

Adam Zerner14 Dec 2023 10:40 UTC
8 points
10 comments2 min readLW link

Up­date on Chi­nese IQ-re­lated gene panels

Lao Mein14 Dec 2023 10:12 UTC
70 points
7 comments1 min readLW link

Red Line Ash­mont Train is Now Approaching

jefftk14 Dec 2023 2:50 UTC
23 points
2 comments1 min readLW link
(www.jefftk.com)

Var­i­ous AI doom path­ways (and how likely they are)

Logan Zoellner14 Dec 2023 0:54 UTC
1 point
1 comment4 min readLW link
(midwitalignment.substack.com)

Are There Ex­am­ples of Over­hang for Other Tech­nolo­gies?

Jeffrey Heninger13 Dec 2023 21:48 UTC
59 points
50 comments11 min readLW link
(blog.aiimpacts.org)

Is be­ing sexy for your homies?

Valentine13 Dec 2023 20:37 UTC
169 points
97 comments14 min readLW link2 reviews

How bad is chlo­ri­nated wa­ter?

bhauth13 Dec 2023 18:00 UTC
43 points
18 comments3 min readLW link
(www.bhauth.com)

[Question] Sugges­tions for net pos­i­tive LLM research

Cole Wyeth13 Dec 2023 17:29 UTC
13 points
6 comments1 min readLW link

AI Con­trol: Im­prov­ing Safety De­spite In­ten­tional Subversion

13 Dec 2023 15:51 UTC
228 points
18 comments10 min readLW link1 review

The Busy Bee Brain

Bill Benzon13 Dec 2023 13:10 UTC
11 points
0 comments6 min readLW link

The Best of Don’t Worry About the Vase

Zvi13 Dec 2023 12:50 UTC
55 points
4 comments13 min readLW link
(thezvi.wordpress.com)

[Question] Has any­one here in­ves­ti­gated the oc­cult com­mu­nity? It is cu­ri­ous to me that many ma­gi­ci­ans con­sider them­selves em­piri­cists.

SpectrumDT13 Dec 2023 11:09 UTC
5 points
10 comments1 min readLW link

AI Views Snapshots

Rob Bensinger13 Dec 2023 0:45 UTC
142 points
61 comments1 min readLW link

The con­ver­gent dy­namic we missed

Remmelt12 Dec 2023 23:19 UTC
2 points
2 comments1 min readLW link

A Kind­ness, or The Inevitable Con­se­quence of Perfect In­fer­ence (a short story)

samhealy12 Dec 2023 23:03 UTC
6 points
0 comments9 min readLW link

Love, Rev­er­ence, and Life

12 Dec 2023 21:49 UTC
36 points
9 comments28 min readLW link2 reviews

Ta­boo “pro­cras­ti­na­tion”

Neil 12 Dec 2023 21:33 UTC
19 points
7 comments1 min readLW link

En­hanc­ing in­tel­li­gence by bang­ing your head on the wall

Bezzi12 Dec 2023 21:00 UTC
37 points
26 comments1 min readLW link

Yamaha P-Series Overview

jefftk12 Dec 2023 20:30 UTC
10 points
1 comment1 min readLW link
(www.jefftk.com)

Balsa Up­date and Gen­eral Thank You

Zvi12 Dec 2023 20:30 UTC
61 points
8 comments8 min readLW link
(thezvi.wordpress.com)

Towards an Ethics Calcu­la­tor for Use by an AGI

sweenesm12 Dec 2023 18:37 UTC
3 points
2 comments11 min readLW link

Why Psy­chol­o­gists Are Wrong About The Illu­sion Of Ex­plana­tory Depth

moses onyedikachukwu12 Dec 2023 18:32 UTC
1 point
0 comments4 min readLW link

A de­sign con­cept for su­per­in­tel­li­gent ma­chines (and Pop­per’s cri­tique of in­duc­tion)

tiplur-bilrex12 Dec 2023 18:31 UTC
−7 points
6 comments1 min readLW link
(tiplur-bilrex.tlon.network)

Sig­nifi­cantly En­hanc­ing Adult In­tel­li­gence With Gene Edit­ing May Be Possible

12 Dec 2023 18:14 UTC
436 points
189 comments33 min readLW link

[Question] Why No Au­to­mated Plagerism De­tec­tion For Past Papers?

Lao Mein12 Dec 2023 17:24 UTC
7 points
10 comments1 min readLW link

OpenAI: Leaks Con­firm the Story

Zvi12 Dec 2023 14:00 UTC
77 points
9 comments16 min readLW link
(thezvi.wordpress.com)

Nav­i­gat­ing the Attackspace

Jonas Kgomo12 Dec 2023 13:59 UTC
1 point
0 comments2 min readLW link

Non­lin­ear’s Ev­i­dence: De­bunk­ing False and Mislead­ing Claims

KatWoods12 Dec 2023 13:16 UTC
104 points
171 comments1 min readLW link

AI In­sti­tu­tion De­sign Hackathon (EAG Bay Area Satel­lite Event)

12 Dec 2023 13:10 UTC
1 point
0 comments1 min readLW link

Fund­ing case: AI Safety Camp

12 Dec 2023 9:08 UTC
66 points
5 comments6 min readLW link
(manifund.org)

What is the next level of ra­tio­nal­ity?

12 Dec 2023 8:14 UTC
48 points
24 comments7 min readLW link

Embed­ded Agents are Quines

12 Dec 2023 4:57 UTC
11 points
7 comments8 min readLW link

Pre­dict the fu­ture! Earn fake in­ter­net points! Get a (free) gam­bling ad­dic­tion!

Robert Cousineau12 Dec 2023 4:39 UTC
3 points
0 comments1 min readLW link

The likely first longevity drug is based on sketchy sci­ence. This is bad for sci­ence and bad for longevity.

BobBurgers12 Dec 2023 2:42 UTC
161 points
34 comments5 min readLW link

When will GPT-5 come out? Pre­dic­tion mar­kets vs. Extrapolation

Malte12 Dec 2023 2:41 UTC
12 points
9 comments3 min readLW link

On plans for a func­tional society

12 Dec 2023 0:07 UTC
41 points
8 comments13 min readLW link

Se­condary Risk Markets

Vaniver11 Dec 2023 21:52 UTC
35 points
4 comments4 min readLW link

Has any­one ex­per­i­mented with Do­drio, a tool for ex­plor­ing trans­former mod­els through in­ter­ac­tive vi­su­al­iza­tion?

Bill Benzon11 Dec 2023 20:34 UTC
4 points
0 comments1 min readLW link