D&D.Sci: Whom Shall You Call?

abstractapplic5 Jul 2024 20:53 UTC
38 points
6 comments2 min readLW link

[In­terim re­search re­port] Ac­ti­va­tion plateaus & sen­si­tive di­rec­tions in GPT2

5 Jul 2024 17:05 UTC
65 points
2 comments5 min readLW link

Min­i­mal­ist And Max­i­mal­ist Type Systems

adamShimi5 Jul 2024 16:25 UTC
17 points
6 comments3 min readLW link
(epistemologicalfascinations.substack.com)

ML4Good Sum­mer Boot­camps—Ap­pli­ca­tions Open [dead­line ex­tended]

YM5 Jul 2024 13:59 UTC
12 points
0 comments1 min readLW link

[Question] Are there any plans to launch a pa­per­back ver­sion of “Ra­tion­al­ity: From AI to Zom­bies”?

m_arj5 Jul 2024 11:14 UTC
2 points
1 comment1 min readLW link

Dooms­day Ar­gu­ment and the False Dilemma of An­thropic Reasoning

Ape in the coat5 Jul 2024 5:38 UTC
37 points
55 comments7 min readLW link

Find­ing the Wis­dom to Build Safe AI

Gordon Seidoh Worley4 Jul 2024 19:04 UTC
36 points
10 comments9 min readLW link

Libs vs Frame­works, Mid­dle-Level Reg­u­lar­i­ties vs Theories

adamShimi4 Jul 2024 19:01 UTC
23 points
0 comments2 min readLW link
(epistemologicalfascinations.substack.com)

The Po­ten­tial Im­pos­si­bil­ity of Sub­jec­tive Death

VictorLJZ4 Jul 2024 18:17 UTC
2 points
34 comments1 min readLW link

Con­sider the hum­ble rock (or: why the dumb thing kills you)

pleiotroth4 Jul 2024 13:54 UTC
62 points
11 comments4 min readLW link

AI #71: Farewell to Chevron

Zvi4 Jul 2024 13:40 UTC
53 points
9 comments36 min readLW link
(thezvi.wordpress.com)

The Dumb­ifi­ca­tion of our smart screens

Itay Dreyfus4 Jul 2024 6:32 UTC
18 points
0 comments5 min readLW link
(productidentity.co)

In­tro­duc­tion to French AI Policy

Lucie Philippon4 Jul 2024 3:39 UTC
110 points
12 comments6 min readLW link

How pre­dic­tive pro­cess­ing solved my wrist pain

max_shen4 Jul 2024 1:56 UTC
35 points
8 comments8 min readLW link

80,000 hours should re­move OpenAI from the Job Board (and similar EA orgs should do similarly)

Raemon3 Jul 2024 20:34 UTC
274 points
71 comments1 min readLW link

Notes on Tun­ing Metacognition

JoNeedsSleep3 Jul 2024 19:54 UTC
8 points
0 comments5 min readLW link

When Are Re­sults from Com­pu­ta­tional Com­plex­ity Not Too Coarse?

Dalcy3 Jul 2024 19:06 UTC
41 points
8 comments3 min readLW link

Mus­ings on LLM Scale (Jul 2024)

Vladimir_Nesov3 Jul 2024 18:35 UTC
34 points
0 comments3 min readLW link

Static Anal­y­sis As A Lifestyle

adamShimi3 Jul 2024 18:29 UTC
65 points
11 comments3 min readLW link
(epistemologicalfascinations.substack.com)

AI de­vel­op­ment is an act of so­cial revolution

artemiocobb3 Jul 2024 18:00 UTC
3 points
0 comments3 min readLW link

[Question] What per­cent of the sun would a Dyson Sphere cover?

Raemon3 Jul 2024 17:27 UTC
24 points
26 comments1 min readLW link

[Question] Iso­mor­phisms don’t pre­serve sub­jec­tive ex­pe­rience… right?

notfnofn3 Jul 2024 14:22 UTC
5 points
26 comments1 min readLW link

3C’s: A Recipe For Mathing Concepts

3 Jul 2024 1:06 UTC
81 points
5 comments7 min readLW link

An­nounc­ing the AI Fore­cast­ing Bench­mark Series | July 8, $120k in Prizes

ChristianWilliams2 Jul 2024 22:33 UTC
15 points
0 comments1 min readLW link
(www.metaculus.com)

Open Sourc­ing Metaculus

ChristianWilliams2 Jul 2024 22:30 UTC
44 points
0 comments1 min readLW link
(www.metaculus.com)

[Question] Why Can’t Sub-AGI Solve AI Align­ment? Or: Why Would Sub-AGI AI Not be Aligned?

MrThink2 Jul 2024 20:13 UTC
4 points
23 comments1 min readLW link

[Question] Why haven’t there been as­sas­si­na­tion at­tempts against high pro­file AI ac­cel­er­a­tionists like sam alt­man yet?

louisTrem2 Jul 2024 18:16 UTC
−13 points
4 comments2 min readLW link

How ARENA course ma­te­rial gets made

CallumMcDougall2 Jul 2024 18:04 UTC
41 points
2 comments7 min readLW link

An AI Race With China Can Be Bet­ter Than Not Racing

niplav2 Jul 2024 17:57 UTC
69 points
33 comments11 min readLW link

List of Col­lec­tive In­tel­li­gence Projects

Chipmonk2 Jul 2024 14:10 UTC
40 points
9 comments2 min readLW link
(chrislakin.blog)

De­com­pos­ing the QK cir­cuit with Bilin­ear Sparse Dic­tionary Learning

2 Jul 2024 13:17 UTC
86 points
7 comments12 min readLW link

Eco­nomics Roundup #2

Zvi2 Jul 2024 12:40 UTC
35 points
5 comments23 min readLW link
(thezvi.wordpress.com)

How Con­gres­sional Offices Pro­cess Con­stituent Communication

Tristan Williams2 Jul 2024 12:38 UTC
24 points
0 comments1 min readLW link

Othel­loGPT learned a bag of heuristics

2 Jul 2024 9:12 UTC
109 points
10 comments9 min readLW link

Blueprint for a Brighter Fu­ture

Alex Beyman2 Jul 2024 6:15 UTC
−1 points
0 comments5 min readLW link

Covert Mal­i­cious Finetuning

2 Jul 2024 2:41 UTC
89 points
4 comments3 min readLW link

In­ter­pret­ing Prefer­ence Models w/​ Sparse Autoencoders

1 Jul 2024 21:35 UTC
74 points
12 comments9 min readLW link

Hon­est sci­ence is spirituality

pchvykov1 Jul 2024 20:33 UTC
−1 points
10 comments4 min readLW link

New Ex­ec­u­tive Team & Board — PIBBSS

Nora_Ammann1 Jul 2024 19:30 UTC
43 points
1 comment1 min readLW link

Un­curs­ing Civilization

Lorec1 Jul 2024 18:44 UTC
−6 points
2 comments5 min readLW link

[Question] Self-cen­sor­ing on AI x-risk dis­cus­sions?

Decaeneus1 Jul 2024 18:24 UTC
17 points
2 comments1 min readLW link

Ra­tion­al­ists As Peo­ple Who Build Piles Of Rocks

Sable1 Jul 2024 10:32 UTC
9 points
0 comments5 min readLW link
(affablyevil.substack.com)

How good are LLMs at do­ing ML on an un­known dataset?

Håvard Tveit Ihle1 Jul 2024 9:04 UTC
33 points
4 comments13 min readLW link

Whirlwind Tour of Chain of Thought Liter­a­ture Rele­vant to Au­tomat­ing Align­ment Re­search.

sevdeawesome1 Jul 2024 5:50 UTC
25 points
0 comments17 min readLW link

Prob­a­bil­is­tic Logic ⇔ Or­a­cles?

Yudhister Kumar1 Jul 2024 5:36 UTC
15 points
0 comments4 min readLW link

Im­por­tant open prob­lems in voting

Closed Limelike Curves1 Jul 2024 2:53 UTC
33 points
1 comment1 min readLW link

Anti-Cir­cum­ci­sion Es­say 3 of 3: Now That I Think About It, Is There Ac­tu­ally a Space Between “Info” and “Hazard”? Isn’t It Just One Word?

Harry Stevenage1 Jul 2024 2:21 UTC
12 points
0 comments7 min readLW link

In Defense of Lawyers Play­ing Their Part

Isaac King1 Jul 2024 1:32 UTC
32 points
9 comments9 min readLW link

Anti-cir­cum­ci­sion Es­say 2 of 3: Phys­i­cal and Psy­cholog­i­cal Realities

Harry Stevenage30 Jun 2024 22:13 UTC
12 points
5 comments9 min readLW link

Re­view of METR’s pub­lic eval­u­a­tion protocol

30 Jun 2024 22:03 UTC
10 points
0 comments5 min readLW link