My cover story in Ja­cobin on AI cap­i­tal­ism and the x-risk debates

garrison12 Feb 2024 23:34 UTC
98 points
5 comments1 min readLW link
(jacobin.com)

What is On­tol­ogy?

martinkunev12 Feb 2024 23:01 UTC
4 points
0 comments4 min readLW link

Thank you for trig­ger­ing me

Cissy12 Feb 2024 20:09 UTC
5 points
1 comment6 min readLW link
(www.moremyself.xyz)

In­ter­pret­ing Quan­tum Me­chan­ics in In­fra-Bayesian Physicalism

Yegreg12 Feb 2024 18:56 UTC
30 points
6 comments43 min readLW link

I played the AI box game as the Gate­keeper — and lost

datawitch12 Feb 2024 18:39 UTC
30 points
53 comments4 min readLW link

The Last Laugh: Ex­plor­ing the Role of Hu­mor as a Bench­mark for Large Lan­guage Models

Greg Robison12 Feb 2024 18:34 UTC
4 points
6 comments11 min readLW link

Nat­u­ral ab­strac­tions are ob­server-de­pen­dent: a con­ver­sa­tion with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC
39 points
13 comments7 min readLW link

Tort Law Can Play an Im­por­tant Role in Miti­gat­ing AI Risk

Gabriel Weil12 Feb 2024 17:17 UTC
38 points
9 comments5 min readLW link

On the Pro­posed Cal­ifor­nia SB 1047

Zvi12 Feb 2024 16:40 UTC
46 points
18 comments12 min readLW link
(thezvi.wordpress.com)

Thoughts on “The Offense-Defense Balance Rarely Changes”

Cullen12 Feb 2024 3:26 UTC
46 points
4 comments1 min readLW link

Skep­ti­cism About Deep­Mind’s “Grand­mas­ter-Level” Chess Without Search

Arjun Panickssery12 Feb 2024 0:56 UTC
55 points
13 comments3 min readLW link

[Question] What are the known difficul­ties with this al­ign­ment ap­proach?

tailcalled11 Feb 2024 22:52 UTC
18 points
24 comments1 min readLW link

[Question] What are the de­cid­ing fac­tors of hu­man cog­ni­tive en­durance?

koratkar11 Feb 2024 21:56 UTC
22 points
3 comments1 min readLW link

Carl Shul­man On Dwarkesh Pod­cast June 2023

Moonicker11 Feb 2024 21:02 UTC
18 points
0 comments159 min readLW link

How do you ac­tu­ally ob­tain and re­port a like­li­hood func­tion for sci­en­tific re­search?

Peter Berggren11 Feb 2024 17:42 UTC
55 points
4 comments1 min readLW link

The en­tropy maxim for bi­nary questions

dkl911 Feb 2024 17:17 UTC
2 points
1 comment1 min readLW link
(dkl9.net)

GPT2XL_RLLMv3 vs. Bet­terDAN, AI Machi­avelli & Oppo Jailbreaks

MiguelDev11 Feb 2024 11:03 UTC
16 points
4 comments14 min readLW link

[Question] What’s the the­ory of im­pact for ac­ti­va­tion vec­tors?

Chris_Leong11 Feb 2024 7:34 UTC
57 points
12 comments1 min readLW link

Ex­per­i­ment­ing With Foot­board Piezos

jefftk11 Feb 2024 3:00 UTC
11 points
2 comments2 min readLW link
(www.jefftk.com)

The Core Values of Life—A pro­posal for a uni­ver­sal the­ory of ethics

Thomas Gjøstøl10 Feb 2024 21:48 UTC
2 points
4 comments18 min readLW link

And All the Shog­goths Merely Players

Zack_M_Davis10 Feb 2024 19:56 UTC
160 points
57 comments12 min readLW link

Sam Alt­man’s Chip Am­bi­tions Un­der­cut OpenAI’s Safety Strategy

garrison10 Feb 2024 19:52 UTC
198 points
52 comments1 min readLW link
(garrisonlovely.substack.com)

The lat­tice of par­tial updatelessness

Martín Soto10 Feb 2024 17:34 UTC
21 points
5 comments5 min readLW link

A Strange ACH Corner Case

jefftk10 Feb 2024 3:00 UTC
27 points
2 comments2 min readLW link
(www.jefftk.com)

Dreams of AI al­ign­ment: The dan­ger of sug­ges­tive names

TurnTrout10 Feb 2024 1:22 UTC
103 points
59 comments4 min readLW link

Sce­nario plan­ning for AI x-risk

Corin Katzke10 Feb 2024 0:14 UTC
24 points
12 comments14 min readLW link
(forum.effectivealtruism.org)

Close the Gates to an In­hu­man Fu­ture: How and why we should choose to not de­velop su­per­hu­man gen­eral-pur­pose ar­tifi­cial intelligence

aaguirre9 Feb 2024 20:25 UTC
13 points
0 comments1 min readLW link
(arxiv.org)

[Cross­post] Deep Dive: The Com­ing Tech­nolog­i­cal Sin­gu­lar­ity—How to sur­vive in a Post-hu­man Era

Suzie. EXE9 Feb 2024 18:49 UTC
2 points
2 comments9 min readLW link

The Ideal Speech Si­tu­a­tion as a Tool for AI Eth­i­cal Reflec­tion: A Frame­work for Alignment

kenneth myers9 Feb 2024 18:40 UTC
6 points
12 comments3 min readLW link

What’s ChatGPT’s Fa­vorite Ice Cream Fla­vor? An In­ves­ti­ga­tion Into Syn­thetic Respondents

Greg Robison9 Feb 2024 18:38 UTC
19 points
4 comments15 min readLW link

OpenAI wants to raise 5-7 trillion

O O9 Feb 2024 16:15 UTC
13 points
29 comments1 min readLW link
(decrypt.co)

[Question] Con­stituency-sized AI congress?

Nathan Helm-Burger9 Feb 2024 16:01 UTC
11 points
5 comments1 min readLW link

One True Love

Zvi9 Feb 2024 15:10 UTC
33 points
7 comments10 min readLW link
(thezvi.wordpress.com)

[Question] Ex­ec­u­tive func­tion ad­vice from peo­ple who are good at it?

TeaTieAndHat9 Feb 2024 10:11 UTC
7 points
1 comment1 min readLW link

[Question] Do you want to make an AI Align­ment song?

Kabir Kumar9 Feb 2024 8:22 UTC
4 points
0 comments1 min readLW link

Skills I’d like my col­lab­o­ra­tors to have

Raemon9 Feb 2024 8:20 UTC
106 points
9 comments8 min readLW link

Trans­fer learn­ing and gen­er­al­iza­tion-qua-ca­pa­bil­ity in Bab­bage and Davinci (or, why di­vi­sion is bet­ter than Span­ish)

RP and agg
9 Feb 2024 7:00 UTC
50 points
6 comments3 min readLW link

Bi­den-Har­ris Ad­minis­tra­tion An­nounces First-Ever Con­sor­tium Ded­i­cated to AI Safety

Ben Smith9 Feb 2024 6:40 UTC
22 points
0 comments1 min readLW link
(www.nist.gov)

Run­ning the Num­bers on a Heat Pump

jefftk9 Feb 2024 3:00 UTC
30 points
12 comments4 min readLW link
(www.jefftk.com)

[Question] How do high-trust so­cieties form?

Shankar Sivarajan9 Feb 2024 1:11 UTC
22 points
17 comments1 min readLW link

[Question] How do health sys­tems work in ad­e­quate wor­lds?

mukashi9 Feb 2024 0:54 UTC
10 points
2 comments1 min readLW link

Twin Cities ACX Meetup—Fe­bru­ary 2024

Timothy M.8 Feb 2024 23:26 UTC
1 point
2 comments1 min readLW link

A re­view of “Don’t for­get the bound­ary prob­lem...”

jessicata8 Feb 2024 23:19 UTC
12 points
1 comment12 min readLW link
(unstablerontology.substack.com)

ain­telope pro­ject update

Gunnar_Zarncke8 Feb 2024 18:32 UTC
24 points
2 comments3 min readLW link

Up­date­less­ness doesn’t solve most problems

Martín Soto8 Feb 2024 17:30 UTC
130 points
44 comments12 min readLW link

Pre­dict­ing Align­ment Award Win­ners Us­ing ChatGPT 4

Shoshannah Tekofsky8 Feb 2024 14:38 UTC
16 points
2 comments11 min readLW link

AI #50: The Most Danger­ous Thing

Zvi8 Feb 2024 14:30 UTC
53 points
4 comments24 min readLW link
(thezvi.wordpress.com)

How to de­velop a pho­to­graphic mem­ory 3/​3

PhilosophicalSoul8 Feb 2024 9:22 UTC
6 points
2 comments18 min readLW link

Believ­ing In

AnnaSalamon8 Feb 2024 7:06 UTC
230 points
51 comments13 min readLW link

Mea­sur­ing pre-peer-re­view epistemic status

Jakub Smékal8 Feb 2024 5:09 UTC
1 point
0 comments2 min readLW link