Bet­ting and forecasting

CarlJ9 Sep 2023 20:03 UTC
2 points
0 comments1 min readLW link

AI pres­i­dents dis­cuss AI al­ign­ment agendas

9 Sep 2023 18:55 UTC
217 points
23 comments1 min readLW link
(www.youtube.com)

Prob­a­bil­is­tic ar­gu­ment re­la­tion­ships and an in­vi­ta­tion to the ar­gu­ment map­ping community

lunatic_at_large9 Sep 2023 18:45 UTC
13 points
4 comments10 min readLW link

How teams went about their re­search at AI Safety Camp edi­tion 8

9 Sep 2023 16:34 UTC
28 points
0 comments13 min readLW link

Panel dis­cus­sion on AI con­scious­ness with Rob Long and Jeff Sebo

Aaron Bergman9 Sep 2023 3:38 UTC
10 points
0 comments1 min readLW link
(www.youtube.com)

Pos­si­ble Diver­gence in AGI Risk Tol­er­ance be­tween Selfish and Altru­is­tic agents

Brad West 9 Sep 2023 0:23 UTC
1 point
1 comment2 min readLW link

Cap­ture the Flag Mechanis­tic In­ter­pretabil­ity Challenges

8 Sep 2023 23:00 UTC
24 points
0 comments7 min readLW link

[Question] What is to be done? (About the profit mo­tive)

Connor Barber8 Sep 2023 19:27 UTC
1 point
21 comments1 min readLW link

What is the op­ti­mal fron­tier for due dili­gence?

8 Sep 2023 18:20 UTC
41 points
1 comment1 min readLW link

Progress links di­gest, 2023-09-08: The Con­ser­va­tive Fu­tur­ist, cargo air­ships, and more

jasoncrawford8 Sep 2023 17:48 UTC
14 points
7 comments5 min readLW link
(rootsofprogress.org)

The AI apoc­a­lypse myth.

Spiritus Dei8 Sep 2023 17:43 UTC
−22 points
12 comments2 min readLW link

Sum-thresh­old attacks

TsviBT8 Sep 2023 17:13 UTC
237 points
55 comments10 min readLW link
(tsvibt.blogspot.com)

De­bate se­ries: should we push for a pause on the de­vel­op­ment of AI?

Xodarap8 Sep 2023 16:29 UTC
38 points
1 comment1 min readLW link

AI Prob­a­bil­ity Trees—Joe Car­l­smith (2022)

Nathan Young8 Sep 2023 15:40 UTC
12 points
1 comment8 min readLW link

In­vad­ing Aus­tralia (End­less Former­lies Most Beau­tiful, or What I Learned On My Holi­day)

Oliver Sourbut8 Sep 2023 15:33 UTC
12 points
1 comment8 min readLW link
(www.oliversourbut.net)

Ex­plain­ing grokking through cir­cuit efficiency

8 Sep 2023 14:39 UTC
101 points
11 comments3 min readLW link
(arxiv.org)

Have At­ten­tion Spans Been De­clin­ing?

niplav8 Sep 2023 14:11 UTC
71 points
22 comments17 min readLW link1 review

Ex­plained Sim­ply: Quantilizers

brook8 Sep 2023 12:54 UTC
15 points
5 comments1 min readLW link
(aisafetyexplained.substack.com)

Cross­ing the Ru­bi­con.

Spiritus Dei8 Sep 2023 4:19 UTC
−4 points
5 comments13 min readLW link

[Question] What EY and LessWrong meant when (fill in the blank) found them.

Bill Benzon8 Sep 2023 1:42 UTC
1 point
0 comments1 min readLW link

Bring back the Colosseums

lc8 Sep 2023 0:09 UTC
18 points
28 comments1 min readLW link

The Löbian Ob­sta­cle, And Why You Should Care

lukemarks7 Sep 2023 23:59 UTC
18 points
6 comments2 min readLW link

Science to Be Done In­ter­na­tion­ally Us­ing Blockchain

Victor Porton7 Sep 2023 23:29 UTC
−18 points
0 comments2 min readLW link
(science-dao.org)

A quick up­date from Nonlinear

KatWoods7 Sep 2023 21:28 UTC
72 points
23 comments2 min readLW link

[Linkpost] Fron­tier AI Task­force: first progress report

Paul Colognese7 Sep 2023 19:06 UTC
21 points
0 comments4 min readLW link
(www.gov.uk)

[Question] How did you make your way back from meta?

matto7 Sep 2023 17:23 UTC
23 points
27 comments1 min readLW link

AI#28: Watch­ing and Waiting

Zvi7 Sep 2023 17:20 UTC
52 points
14 comments45 min readLW link
(thezvi.wordpress.com)

[Question] Mea­sure of com­plex­ity al­lowed by the laws of the uni­verse and rel­a­tive the­ory?

dr_s7 Sep 2023 12:21 UTC
8 points
22 comments1 min readLW link

Re­cre­at­ing the car­ing drive

Catnee7 Sep 2023 10:41 UTC
43 points
15 comments10 min readLW link1 review

Shar­ing In­for­ma­tion About Nonlinear

Ben Pace7 Sep 2023 6:51 UTC
322 points
323 comments34 min readLW link

Weekly In­ci­dence vs Cu­mu­la­tive Infections

jefftk7 Sep 2023 2:30 UTC
13 points
6 comments1 min readLW link
(www.jefftk.com)

Im­prov­ing Math­e­mat­i­cal Ac­cu­racy in LLMs—New Monthly Up­dates Series − 1

Abhay Chowdhry7 Sep 2023 1:58 UTC
5 points
1 comment9 min readLW link

Break­ing RLHF “Safety” (And how to fix it?)

MPotter7 Sep 2023 1:58 UTC
3 points
0 comments4 min readLW link

Feed­back-loops, De­liber­ate Prac­tice, and Trans­fer Learning

7 Sep 2023 1:57 UTC
46 points
5 comments1 min readLW link

Video es­say: How Will We Know When AI is Con­scious?

JanPro6 Sep 2023 18:10 UTC
11 points
7 comments1 min readLW link
(www.youtube.com)

My First Post

Jaivardhan Nawani6 Sep 2023 17:42 UTC
35 points
9 comments1 min readLW link

Ac­tAdd: Steer­ing Lan­guage Models with­out Optimization

6 Sep 2023 17:21 UTC
105 points
3 comments2 min readLW link
(arxiv.org)

Monthly Roundup #10: Septem­ber 2023

Zvi6 Sep 2023 13:20 UTC
35 points
4 comments56 min readLW link
(thezvi.wordpress.com)

Find Hot French Food Near Me: A Fol­low-up

aphyer6 Sep 2023 12:32 UTC
75 points
19 comments2 min readLW link

Man­i­fest 2023

6 Sep 2023 11:24 UTC
3 points
0 comments1 min readLW link

Last Chance: Get tick­ets to Man­i­fest 2023! (Sep 22-24 in Berkeley)

6 Sep 2023 10:35 UTC
5 points
0 comments1 min readLW link

What I’ve been read­ing, Septem­ber 2023

jasoncrawford6 Sep 2023 9:32 UTC
17 points
0 comments5 min readLW link
(rootsofprogress.org)

De­ci­sion The­ory: A (Nor­ma­tive) Introduction

Pareto Optimal6 Sep 2023 8:22 UTC
−1 points
1 comment3 min readLW link
(paretooptimal.substack.com)

[Question] What’s the eas­iest way to make a lu­mi­na­tor?

kuira6 Sep 2023 0:07 UTC
7 points
13 comments1 min readLW link

Or­di­nary claims re­quire or­di­nary evidence

blake80865 Sep 2023 22:09 UTC
1 point
3 comments2 min readLW link

Con­ver­sa­tion about paradigms, in­tel­lec­tual progress, so­cial con­sen­sus, and AI

5 Sep 2023 21:30 UTC
14 points
6 comments1 min readLW link

What I would do if I wasn’t at ARC Evals

LawrenceC5 Sep 2023 19:19 UTC
220 points
10 comments13 min readLW link1 review

The Evolu­tion­ary Path­way from Biolog­i­cal to Digi­tal In­tel­li­gence: A Cos­mic Perspective

George3605 Sep 2023 17:47 UTC
−17 points
0 comments4 min readLW link

The Illu­sion of Univer­sal Mo­ral­ity: A Dy­namic Per­spec­tive on Ge­netic Fit­ness and Eth­i­cal Complexity

George3605 Sep 2023 17:47 UTC
−9 points
7 comments2 min readLW link

Bench­marks for De­tect­ing Mea­sure­ment Tam­per­ing [Red­wood Re­search]

5 Sep 2023 16:44 UTC
86 points
21 comments20 min readLW link
(arxiv.org)