Up­dat­ing Drexler’s CAIS model

Matthew Barnett16 Jun 2023 22:53 UTC
47 points
32 comments4 min readLW link

Avoid­ing meta­physics means giv­ing bad philos­o­phy a free pass

Aditya16 Jun 2023 20:54 UTC
5 points
9 comments4 min readLW link

Crit­i­cism of Eliezer’s ir­ra­tional moral beliefs

Jorterder16 Jun 2023 20:47 UTC
−17 points
21 comments1 min readLW link

Car­tog­ra­phy, blow­ing one’s mind, the illu­sion of sep­a­ra­tion and other gen­eral musings

Neil 16 Jun 2023 19:19 UTC
0 points
4 comments2 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Small Transformers

16 Jun 2023 18:02 UTC
52 points
0 comments5 min readLW link

Longevity: Dou­ble Hu­man Lifes­pan in the Next Decade?

Jannik Schg16 Jun 2023 17:51 UTC
1 point
0 comments1 min readLW link

LLMs Some­times Gen­er­ate Purely Nega­tively-Re­in­forced Text

Fabien Roger16 Jun 2023 16:31 UTC
177 points
11 comments7 min readLW link

Palan­tir’s AI models

ChristianKl16 Jun 2023 16:20 UTC
26 points
16 comments1 min readLW link
(www.palantir.com)

[Linkpost] Faith and Fate: Limits of Trans­form­ers on Compositionality

Joe Kwon16 Jun 2023 15:04 UTC
19 points
4 comments1 min readLW link
(arxiv.org)

The ones who endure

Richard_Ngo16 Jun 2023 14:40 UTC
61 points
16 comments5 min readLW link
(www.thinkingcomplete.com)

Con­jec­ture: A stand­ing offer for pub­lic de­bates on AI

Andrea_Miotti16 Jun 2023 14:33 UTC
29 points
1 comment2 min readLW link
(www.conjecture.dev)

Ex­plain­ing “Tak­ing fea­tures out of su­per­po­si­tion with sparse au­toen­coders”

Robert_AIZI16 Jun 2023 13:59 UTC
10 points
0 comments8 min readLW link
(aizi.substack.com)

[Question] How not to write the Cook­book of Doom?

brunoparga16 Jun 2023 13:37 UTC
17 points
5 comments1 min readLW link

Scaf­folded LLMs: Less Ob­vi­ous Concerns

Stephen Fowler16 Jun 2023 10:39 UTC
32 points
13 comments11 min readLW link

Mo­ti­va­tion in AI

nickasaf16 Jun 2023 9:50 UTC
−1 points
1 comment2 min readLW link

DSLT 0. Distill­ing Sin­gu­lar Learn­ing Theory

Liam Carroll16 Jun 2023 9:50 UTC
77 points
6 comments5 min readLW link

DSLT 1. The RLCT Mea­sures the Effec­tive Di­men­sion of Neu­ral Networks

Liam Carroll16 Jun 2023 9:50 UTC
51 points
9 comments13 min readLW link

[Linkpost] Map­ping Brains with Lan­guage Models: A Survey

Bogdan Ionut Cirstea16 Jun 2023 9:49 UTC
5 points
0 comments1 min readLW link

Ra­tional An­i­ma­tions is look­ing for an AI Safety scriptwriter, a lead com­mu­nity man­ager, and other roles.

Writer16 Jun 2023 9:41 UTC
74 points
1 comment3 min readLW link

[Question] Does any­one’s full-time job in­clude read­ing and un­der­stand­ing all the most-promis­ing for­mal AI al­ign­ment work?

Nicholas / Heather Kross16 Jun 2023 2:24 UTC
15 points
2 comments1 min readLW link

Level­ing Up Or Level­ing Off? Un­der­stand­ing The Science Be­hind Skill Plateaus

lynettebye16 Jun 2023 0:18 UTC
45 points
9 comments18 min readLW link

hu­man in­tel­li­gence may be al­ign­ment-limited

bhauth15 Jun 2023 22:32 UTC
16 points
3 comments2 min readLW link

Devel­op­ing a tech­nol­ogy with safety in mind: Les­sons from the Wright Brothers

jasoncrawford15 Jun 2023 21:08 UTC
30 points
4 comments3 min readLW link
(rootsofprogress.org)

AXRP Epi­sode 22 - Shard The­ory with Quintin Pope

DanielFilan15 Jun 2023 19:00 UTC
52 points
11 comments93 min readLW link

Can we ac­cel­er­ate hu­man progress? Moder­ated Con­ver­sa­tion in NYC

Jannik Schg15 Jun 2023 17:33 UTC
1 point
0 comments1 min readLW link

Group Pri­ori­tar­i­anism: Why AI Should Not Re­place Hu­man­ity [draft]

fsh15 Jun 2023 17:33 UTC
8 points
0 comments25 min readLW link

Press the hap­piness but­ton!

Spiarrow15 Jun 2023 17:30 UTC
5 points
3 comments2 min readLW link

AI #16: AI in the UK

Zvi15 Jun 2023 13:20 UTC
46 points
20 comments54 min readLW link
(thezvi.wordpress.com)

I still think it’s very un­likely we’re ob­serv­ing alien aircraft

dynomight15 Jun 2023 13:01 UTC
180 points
70 comments5 min readLW link
(dynomight.net)

Aligned Ob­jec­tives Prize Competition

Prometheus15 Jun 2023 12:42 UTC
8 points
0 comments2 min readLW link
(app.impactmarkets.io)

A more effec­tive Ele­va­tor Pitch for AI risk

Iknownothing15 Jun 2023 12:39 UTC
2 points
0 comments1 min readLW link

Why “AI al­ign­ment” would bet­ter be re­named into “Ar­tifi­cial In­ten­tion re­search”

chaosmage15 Jun 2023 10:32 UTC
29 points
12 comments2 min readLW link

Matt Taibbi’s COVID reporting

ChristianKl15 Jun 2023 9:49 UTC
21 points
34 comments1 min readLW link
(www.racket.news)

Look­ing Back On Ads

jefftk15 Jun 2023 2:10 UTC
30 points
11 comments3 min readLW link
(www.jefftk.com)

Why liber­tar­i­ans are ad­vo­cat­ing for reg­u­la­tion on AI

RobertM14 Jun 2023 20:59 UTC
35 points
13 comments4 min readLW link

In­stru­men­tal Con­ver­gence? [Draft]

J. Dmitri Gallow14 Jun 2023 20:21 UTC
48 points
20 comments33 min readLW link

On the Ap­ple Vi­sion Pro

Zvi14 Jun 2023 17:50 UTC
44 points
17 comments11 min readLW link
(thezvi.wordpress.com)

Progress links and tweets, 2023-06-14

jasoncrawford14 Jun 2023 16:30 UTC
19 points
1 comment2 min readLW link
(rootsofprogress.org)

Philo­soph­i­cal Cy­borg (Part 1)

14 Jun 2023 16:20 UTC
31 points
4 comments13 min readLW link

Is the con­fir­ma­tion bias re­ally a bias?

Lionel14 Jun 2023 14:06 UTC
−2 points
6 comments1 min readLW link
(lionelpage.substack.com)

NA East ACX & Ra­tion­al­ity Meetup Or­ga­niz­ers Retreat

Willa14 Jun 2023 13:39 UTC
8 points
0 comments1 min readLW link

Light­cone In­fras­truc­ture/​LessWrong is look­ing for funding

habryka14 Jun 2023 4:45 UTC
205 points
39 comments1 min readLW link

An­thropic | Chart­ing a Path to AI Accountability

Gabe M14 Jun 2023 4:43 UTC
34 points
2 comments3 min readLW link
(www.anthropic.com)

De­mys­tify­ing Born’s rule

Christopher King14 Jun 2023 3:16 UTC
5 points
26 comments3 min readLW link

My guess for why I was wrong about US housing

romeostevensit14 Jun 2023 0:37 UTC
110 points
13 comments1 min readLW link

Notes from the Bank of England Talk by Gio­vanni Dosi on Agent-based Model­ing for Macroeconomics

PixelatedPenguin13 Jun 2023 22:25 UTC
3 points
0 comments1 min readLW link

In­tro­duc­ing The Long Game Pro­ject: Im­prov­ing De­ci­sion-Mak­ing Through Table­top Ex­er­cises and Si­mu­lated Experience

Dan Stuart13 Jun 2023 21:45 UTC
4 points
0 comments4 min readLW link

In­tel­li­gence al­lo­ca­tion from a Mean Field Game The­ory perspective

Marv K13 Jun 2023 19:52 UTC
13 points
2 comments2 min readLW link

Mul­ti­ple stages of fal­lacy—jus­tifi­ca­tions and non-jus­tifi­ca­tions for the mul­ti­ple stage fallacy

AronT13 Jun 2023 17:37 UTC
33 points
2 comments5 min readLW link
(coordinationishard.substack.com)

TryCon­tra Events

jefftk13 Jun 2023 17:30 UTC
2 points
0 comments1 min readLW link
(www.jefftk.com)