Time com­plex­ity for de­ter­minis­tic string machines

alcatal21 Apr 2024 22:35 UTC
21 points
0 comments21 min readLW link

Trans­fer Learn­ing in Humans

niplav21 Apr 2024 20:49 UTC
57 points
1 comment13 min readLW link

I cre­ated an Asi Align­ment Tier List

TimeGoat21 Apr 2024 18:44 UTC
−6 points
0 comments1 min readLW link

The los­ing iden­tity of Twitter

Itay Dreyfus21 Apr 2024 13:43 UTC
20 points
1 comment12 min readLW link
(productidentity.co)

Good Bings copy, great Bings steal

dr_s21 Apr 2024 9:52 UTC
31 points
6 comments9 min readLW link

Paper: “The Ethics of Ad­vanced AI As­sis­tants” -Google DeepMind

Tristan Wegner21 Apr 2024 6:45 UTC
20 points
0 comments1 min readLW link
(storage.googleapis.com)

Con­tra Chord Simplification

jefftk21 Apr 2024 2:30 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

A cou­ple pro­duc­tivity tips for overthinkers

Steven Byrnes20 Apr 2024 16:05 UTC
78 points
13 comments4 min readLW link

“You’re the most beau­tiful girl in the world” and Wittgen­stei­nian Lan­guage Games

Chris_Leong20 Apr 2024 14:54 UTC
5 points
18 comments1 min readLW link

Past Tense Features

Can20 Apr 2024 14:34 UTC
12 points
0 comments4 min readLW link

Thoughts on seed oil

dynomight20 Apr 2024 12:29 UTC
347 points
129 comments17 min readLW link
(dynomight.net)

How to know whether you are an ideal­ist or a phys­i­cal­ist/​materialist

JackOfAllTrades20 Apr 2024 11:53 UTC
−3 points
2 comments1 min readLW link

How I Think, Part Four: Money is Weird

Richard Henage20 Apr 2024 6:21 UTC
0 points
3 comments5 min readLW link

The power of finite and the weak­ness of in­finite bi­nary point numbers

AxiomWriter20 Apr 2024 6:03 UTC
−3 points
6 comments2 min readLW link

WISDOMISM A Mo­ral The­ory for the Age of Information

Peter lawless 19 Apr 2024 23:06 UTC
2 points
0 comments9 min readLW link

In­duc­ing Un­prompted Misal­ign­ment in LLMs

19 Apr 2024 20:00 UTC
38 points
7 comments16 min readLW link

Introspection

A*19 Apr 2024 19:10 UTC
7 points
0 comments1 min readLW link

[Full Post] Progress Up­date #1 from the GDM Mech In­terp Team

19 Apr 2024 19:06 UTC
77 points
10 comments8 min readLW link

[Sum­mary] Progress Up­date #1 from the GDM Mech In­terp Team

19 Apr 2024 19:06 UTC
72 points
0 comments3 min readLW link

Daniel Den­nett has died (1942-2024)

kave19 Apr 2024 16:17 UTC
150 points
5 comments1 min readLW link
(dailynous.com)

Events Book­ing New Callers?

jefftk19 Apr 2024 15:50 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] What is the best way to talk about prob­a­bil­ities you ex­pect to change with ev­i­dence/​ex­per­i­ments?

Will_Pearson19 Apr 2024 15:35 UTC
14 points
11 comments1 min readLW link

CTMU in­sight: maybe con­scious­ness *can* af­fect quan­tum out­comes?

zhukeepa19 Apr 2024 15:23 UTC
13 points
11 comments5 min readLW link

De­mon­strate and eval­u­ate risks from AI to so­ciety at the AI x Democ­racy re­search hackathon

Esben Kran19 Apr 2024 14:46 UTC
5 points
0 comments1 min readLW link
(www.apartresearch.com)

[Question] How to Model the Fu­ture of Open-Source LLMs?

Joel Burget19 Apr 2024 14:28 UTC
25 points
9 comments1 min readLW link

What’s up with all the non-Mor­mons? Weirdly spe­cific uni­ver­sal­ities across LLMs

mwatkins19 Apr 2024 13:43 UTC
40 points
13 comments27 min readLW link

[Question] If digi­tal goods in vir­tual wor­lds in­crease GDP, do we ac­tu­ally be­come richer?

No77e19 Apr 2024 10:06 UTC
6 points
10 comments1 min readLW link

Ex­per­i­ment on re­peat­ing choices

KatjaGrace19 Apr 2024 4:20 UTC
56 points
1 comment3 min readLW link
(worldspiritsockpuppet.com)

Effec­tive Altru­ists and Ra­tion­al­ists Views & The case for us­ing mar­ket­ing to high­light AI risks.

gilch19 Apr 2024 4:16 UTC
6 points
1 comment1 min readLW link
(youtu.be)

Co­he­sion and busi­ness problems

Adam Zerner19 Apr 2024 0:45 UTC
12 points
8 comments4 min readLW link

The Ther­mo­dy­nam­ics of Death

Peter lawless 19 Apr 2024 0:36 UTC
6 points
0 comments10 min readLW link

Back­yard Office

jefftk19 Apr 2024 0:31 UTC
13 points
0 comments1 min readLW link
(www.jefftk.com)

hy­dro­gen tube transport

bhauth18 Apr 2024 22:47 UTC
34 points
12 comments5 min readLW link
(www.bhauth.com)

LessOn­line Fes­ti­val Up­dates Thread

Ben Pace18 Apr 2024 21:55 UTC
59 points
26 comments1 min readLW link

A Re­view of In-Con­text Learn­ing Hy­pothe­ses for Au­to­mated AI Align­ment Research

alamerton18 Apr 2024 18:29 UTC
25 points
4 comments16 min readLW link

I’m open for pro­jects (sort of)

cousin_it18 Apr 2024 18:05 UTC
46 points
13 comments1 min readLW link

Blessed in­for­ma­tion, garbage in­for­ma­tion, cursed information

tailcalled18 Apr 2024 16:56 UTC
23 points
8 comments3 min readLW link

[Fic­tion] A Confession

Arjun Panickssery18 Apr 2024 16:28 UTC
38 points
2 comments5 min readLW link
(arjunpanickssery.substack.com)

Discrim­i­nat­ing Be­hav­iorally Iden­ti­cal Clas­sifiers: a model prob­lem for ap­ply­ing in­ter­pretabil­ity to scal­able oversight

Sam Marks18 Apr 2024 16:17 UTC
107 points
10 comments12 min readLW link

Co­op­er­a­tion is op­ti­mal, with weaker agents too  -  tldr

Ryo 18 Apr 2024 15:03 UTC
12 points
22 comments4 min readLW link
(medium.com)

How to co­or­di­nate de­spite our bi­ases? - tldr

Ryo 18 Apr 2024 15:03 UTC
3 points
2 comments3 min readLW link
(medium.com)

Knowl­edge Base 7: Long-tail knowl­edge and col­lec­tive intelligence

iwis18 Apr 2024 14:21 UTC
−6 points
0 comments1 min readLW link

AI #60: Oh the Humanity

Zvi18 Apr 2024 14:10 UTC
44 points
7 comments62 min readLW link
(thezvi.wordpress.com)

UDT1.01: Log­i­cal In­duc­tors and Im­plicit Beliefs (5/​10)

Diffractor18 Apr 2024 8:39 UTC
33 points
2 comments19 min readLW link

An ex­am­i­na­tion of GPT-2′s bor­ing yet effec­tive glitch

MiguelDev18 Apr 2024 5:26 UTC
5 points
3 comments3 min readLW link

[Question] What if Ethics is Prov­ably Self-Con­tra­dic­tory?

Yitz18 Apr 2024 5:12 UTC
3 points
7 comments2 min readLW link

The Mom Test: Sum­mary and Thoughts

Adam Zerner18 Apr 2024 3:34 UTC
48 points
3 comments10 min readLW link

Ex­press in­ter­est in an “FHI of the West”

habryka18 Apr 2024 3:32 UTC
268 points
41 comments3 min readLW link

Why Would Belief-States Have A Frac­tal Struc­ture, And Why Would That Mat­ter For In­ter­pretabil­ity? An Explainer

18 Apr 2024 0:27 UTC
184 points
21 comments7 min readLW link

AXRP Epi­sode 28 - Su­ing Labs for AI Risk with Gabriel Weil

DanielFilan17 Apr 2024 21:42 UTC
12 points
0 comments65 min readLW link