Inos­i­tol Non-Results

Elizabeth29 Nov 2023 21:40 UTC
20 points
2 comments1 min readLW link
(acesounderglass.com)

Los­ing Me­taphors: Zip and Paste

jefftk29 Nov 2023 20:31 UTC
26 points
6 comments1 min readLW link
(www.jefftk.com)

Pre­serv­ing our her­i­tage: Build­ing a move­ment and a knowl­edge ark for cur­rent and fu­ture generations

rnk829 Nov 2023 19:20 UTC
0 points
5 comments12 min readLW link

AGI Align­ment is Absurd

Youssef Mohamed29 Nov 2023 19:11 UTC
−9 points
4 comments3 min readLW link

The ori­gins of the steam en­g­ine: An es­say with in­ter­ac­tive an­i­mated diagrams

jasoncrawford29 Nov 2023 18:30 UTC
30 points
1 comment1 min readLW link
(rootsofprogress.org)

ChatGPT 4 solved all the gotcha prob­lems I posed that tripped ChatGPT 3.5

VipulNaik29 Nov 2023 18:11 UTC
33 points
16 comments14 min readLW link

“Clean” vs. “messy” goal-di­rect­ed­ness (Sec­tion 2.2.3 of “Schem­ing AIs”)

Joe Carlsmith29 Nov 2023 16:32 UTC
29 points
1 comment11 min readLW link

Ly­ing Align­ment Chart

Zack_M_Davis29 Nov 2023 16:15 UTC
76 points
17 comments1 min readLW link

Re­think Pri­ori­ties: Seek­ing Ex­pres­sions of In­ter­est for Spe­cial Pro­jects Next Year

kierangreig29 Nov 2023 13:59 UTC
4 points
0 comments5 min readLW link

[Question] Thoughts on tele­trans­porta­tion with copies?

titotal29 Nov 2023 12:56 UTC
15 points
13 comments1 min readLW link

In­ter­pretabil­ity with Sparse Au­toen­coders (Co­lab ex­er­cises)

CallumMcDougall29 Nov 2023 12:56 UTC
74 points
9 comments4 min readLW link

The 101 Space You Will Always Have With You

Screwtape29 Nov 2023 4:56 UTC
250 points
20 comments6 min readLW link

Trust your in­tu­ition—Kah­ne­man’s book misses the for­est for the trees

mnvr29 Nov 2023 4:37 UTC
−2 points
2 comments2 min readLW link

Pro­cess Sub­sti­tu­tion Without Shell?

jefftk29 Nov 2023 3:20 UTC
19 points
18 comments2 min readLW link
(www.jefftk.com)

De­cep­tion Chess: Game #2

Zane29 Nov 2023 2:43 UTC
29 points
17 comments2 min readLW link

Black Box Biology

GeneSmith29 Nov 2023 2:27 UTC
62 points
30 comments2 min readLW link

[Question] What would be the shelf life of nu­clear weapon-se­crecy if nu­clear weapons had not im­me­di­ately been used in com­bat?

Gram Stone29 Nov 2023 0:53 UTC
7 points
2 comments1 min readLW link

Scal­ing laws for dom­i­nant as­surance contracts

jessicata28 Nov 2023 23:11 UTC
36 points
5 comments7 min readLW link
(unstableontology.com)

I’m con­fused about in­nate smell neuroanatomy

Steven Byrnes28 Nov 2023 20:49 UTC
39 points
2 comments9 min readLW link

How to Con­trol an LLM’s Be­hav­ior (why my P(DOOM) went down)

RogerDearnaley28 Nov 2023 19:56 UTC
64 points
30 comments11 min readLW link

[Question] Is there a word for dis­crim­i­na­tion against A.I.?

Aaron Bohannon28 Nov 2023 19:03 UTC
1 point
4 comments1 min readLW link

Up­date #2 to “Dom­i­nant As­surance Con­tract Plat­form”: EnsureDone

moyamo28 Nov 2023 18:02 UTC
33 points
2 comments1 min readLW link

Ethico­physics II: Poli­tics is the Mind-Savior

MadHatter28 Nov 2023 16:27 UTC
−9 points
9 comments4 min readLW link
(bittertruths.substack.com)

Nei­ther EA nor e/​acc is what we need to build the future

jasoncrawford28 Nov 2023 16:04 UTC
0 points
22 comments3 min readLW link
(rootsofprogress.org)

Agen­tic Growth

Logan Kieller28 Nov 2023 15:45 UTC
1 point
0 comments3 min readLW link
(logankieller.substack.com)

AISC pro­ject: How promis­ing is au­tomat­ing al­ign­ment re­search? (liter­a­ture re­view)

Bogdan Ionut Cirstea28 Nov 2023 14:47 UTC
4 points
1 comment1 min readLW link
(docs.google.com)

A day in the life of a mechanis­tic in­ter­pretabil­ity researcher

Bill Benzon28 Nov 2023 14:45 UTC
3 points
3 comments1 min readLW link

Two sources of be­yond-epi­sode goals (Sec­tion 2.2.2 of “Schem­ing AIs”)

Joe Carlsmith28 Nov 2023 13:49 UTC
11 points
1 comment15 min readLW link

Self-Refer­en­tial Prob­a­bil­is­tic Logic Ad­mits the Payor’s Lemma

Yudhister Kumar28 Nov 2023 10:27 UTC
80 points
14 comments6 min readLW link

[Question] How can I use AI with­out in­creas­ing AI-risk?

Yoav Ravid28 Nov 2023 10:05 UTC
18 points
6 comments1 min readLW link

A Read­ing From The Book Of Sequences

Screwtape28 Nov 2023 6:45 UTC
8 points
0 comments4 min readLW link

An­thropic Fall 2023 De­bate Progress Update

Ansh Radhakrishnan28 Nov 2023 5:37 UTC
74 points
9 comments12 min readLW link

Apoca­lypse in­surance, and the hardline liber­tar­ian take on AI risk

So8res28 Nov 2023 2:09 UTC
122 points
38 comments7 min readLW link

My techno-op­ti­mism [By Vi­talik Bu­terin]

habryka27 Nov 2023 23:53 UTC
107 points
17 comments2 min readLW link
(www.lesswrong.com)

[Question] Could Ger­many have won World War I with high prob­a­bil­ity given the benefit of hind­sight?

Roko27 Nov 2023 22:52 UTC
10 points
18 comments1 min readLW link

[Question] Could World War I have been pre­vented given the benefit of hind­sight?

Roko27 Nov 2023 22:39 UTC
16 points
8 comments1 min readLW link

AISC 2024 - Pro­ject Summaries

NickyP27 Nov 2023 22:32 UTC
48 points
3 comments18 min readLW link

“Epistemic range of mo­tion” and LessWrong moderation

27 Nov 2023 21:58 UTC
60 points
3 comments12 min readLW link

Ap­ply to the Con­cep­tual Boundaries Work­shop for AI Safety

Chipmonk27 Nov 2023 21:04 UTC
50 points
0 comments3 min readLW link

There is no IQ for AI

Gabriel Alfour27 Nov 2023 18:21 UTC
30 points
10 comments9 min readLW link
(cognition.cafe)

Two con­cepts of an “epi­sode” (Sec­tion 2.2.1 of “Schem­ing AIs”)

Joe Carlsmith27 Nov 2023 18:01 UTC
19 points
1 comment13 min readLW link

[Linkpost] Ge­orge Mack’s Razors

trevor27 Nov 2023 17:53 UTC
38 points
8 comments3 min readLW link
(twitter.com)

On pos­si­ble cross-fer­til­iza­tion be­tween AI and neu­ro­science [Creativity]

Bill Benzon27 Nov 2023 16:50 UTC
15 points
22 comments7 min readLW link

Ethico­physics I

MadHatter27 Nov 2023 15:44 UTC
−1 points
16 comments1 min readLW link
(open.substack.com)

Sen­tience In­sti­tute 2023 End of Year Summary

michael_dello27 Nov 2023 12:11 UTC
11 points
0 comments5 min readLW link
(www.sentienceinstitute.org)

[Question] A Ques­tion about Cor­rigi­bil­ity (2015)

A.H.27 Nov 2023 12:05 UTC
4 points
2 comments1 min readLW link

Ap­pen­dices to the live agendas

27 Nov 2023 11:10 UTC
16 points
4 comments1 min readLW link

Shal­low re­view of live agen­das in al­ign­ment & safety

27 Nov 2023 11:10 UTC
322 points
69 comments29 min readLW link

Napoleon stole the Ro­man In­qui­si­tion archives and in­ves­ti­gated the Gal­ileo case

Meow P27 Nov 2023 9:41 UTC
−3 points
0 comments1 min readLW link
(www.cricetuscricetus.co.uk)

Found Paper: “FDT in an evolu­tion­ary en­vi­ron­ment”

the gears to ascension27 Nov 2023 5:27 UTC
27 points
47 comments1 min readLW link
(arxiv.org)