Me­tac­u­lus’s New Side­bar Helps You Find Fore­casts Faster

ChristianWilliams8 Nov 2023 20:56 UTC
15 points
0 comments1 min readLW link
(www.metaculus.com)

Open-ended ethics of phe­nom­ena (a desider­ata with uni­ver­sal moral­ity)

Ryo 8 Nov 2023 20:10 UTC
1 point
0 comments8 min readLW link

De­con­fus­ing “on­tol­ogy” in AI alignment

Dylan Bowman8 Nov 2023 20:03 UTC
28 points
3 comments7 min readLW link

Open Agency model can solve the AI reg­u­la­tion dilemma

Roman Leventov8 Nov 2023 20:00 UTC
22 points
1 comment2 min readLW link

Gothen­burg LW /​ ACX meetup

Stefan8 Nov 2023 19:52 UTC
1 point
0 comments1 min readLW link

[Question] Why is less­wrong block­ing wget and curl (scrape)?

nick lacombe8 Nov 2023 19:42 UTC
21 points
12 comments1 min readLW link

[Question] Is there a less­wrong archive of all pub­lic posts?

nick lacombe8 Nov 2023 19:26 UTC
12 points
7 comments1 min readLW link

Five pro­jects from AI Safety Hub Labs 2023

charlie_griffin8 Nov 2023 19:19 UTC
47 points
1 comment6 min readLW link
(www.aisafetyhub.org)

[Question] Can a stupid per­son be­come in­tel­li­gent?

A. T.8 Nov 2023 19:01 UTC
12 points
24 comments2 min readLW link

Pros­thetic Intelligence

Krantz8 Nov 2023 19:01 UTC
4 points
9 comments2 min readLW link

[Question] Do you have a satis­fac­tory work­flow for learn­ing about a line of re­search us­ing GPT4, Claude, etc?

ryan_b8 Nov 2023 18:05 UTC
9 points
3 comments1 min readLW link

What’s go­ing on? LLMs and IS-A sen­tences

Bill Benzon8 Nov 2023 16:58 UTC
6 points
15 comments4 min readLW link

[Question] What will hap­pen with real es­tate prices dur­ing a slow take­off?

Ricardo Meneghin8 Nov 2023 11:58 UTC
8 points
1 comment1 min readLW link

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

8 Nov 2023 11:37 UTC
49 points
0 comments18 min readLW link

How well does your re­search adress the the­ory-prac­tice gap?

Jonas Hallgren8 Nov 2023 11:27 UTC
18 points
0 comments10 min readLW link

Growth and Form in a Toy Model of Superposition

8 Nov 2023 11:08 UTC
89 points
7 comments14 min readLW link

Run­ning your own work­shop on han­dling hos­tile disagreements

Camille Berger 8 Nov 2023 10:28 UTC
12 points
1 comment7 min readLW link

Think­ing By The Clock

Screwtape8 Nov 2023 7:40 UTC
185 points
27 comments8 min readLW link

[Question] Im­pres­sions from base-GPT-4?

mishka8 Nov 2023 5:43 UTC
25 points
25 comments1 min readLW link

Quan­topian con­test, but for food in­take and weight

Lucent8 Nov 2023 5:41 UTC
40 points
9 comments3 min readLW link

How I Think, Part Two: Distrust­ing Individuals

Richard Henage8 Nov 2023 4:06 UTC
4 points
6 comments3 min readLW link

How I Think, Part One: In­vest­ing in Fun

Richard Henage8 Nov 2023 4:00 UTC
5 points
2 comments5 min readLW link

Con­crete pos­i­tive vi­sions for a fu­ture with­out AGI

Max H8 Nov 2023 3:12 UTC
41 points
28 comments8 min readLW link

South Bay ACX/​LW/​EA Meetup & Ve­gans­giv­ing Potluck

IS8 Nov 2023 2:30 UTC
10 points
0 comments1 min readLW link

Progress links di­gest, 2023-11-07: Techno-op­ti­mism and more

jasoncrawford8 Nov 2023 2:05 UTC
17 points
7 comments11 min readLW link
(rootsofprogress.org)

An­nounc­ing Athena—Women in AI Align­ment Research

Claire Short7 Nov 2023 21:46 UTC
80 points
2 comments3 min readLW link

Vote on In­ter­est­ing Disagreements

Ben Pace7 Nov 2023 21:35 UTC
159 points
129 comments1 min readLW link

What is democ­racy for?

Johnstone7 Nov 2023 18:17 UTC
−5 points
10 comments7 min readLW link

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023 17:59 UTC
36 points
2 comments2 min readLW link
(arxiv.org)

Im­ple­ment­ing De­ci­sion Theory

justinpombrio7 Nov 2023 17:55 UTC
22 points
12 comments3 min readLW link

Mir­ror, Mir­ror on the Wall: How Do Fore­cast­ers Fare by Their Own Call?

nikos7 Nov 2023 17:39 UTC
14 points
5 comments14 min readLW link

Sym­biotic self-al­ign­ment of AIs.

Spiritus Dei7 Nov 2023 17:18 UTC
1 point
0 comments3 min readLW link

AMA: Earn­ing to Give

jefftk7 Nov 2023 16:20 UTC
53 points
8 comments1 min readLW link
(www.jefftk.com)

The Stochas­tic Par­rot Hy­poth­e­sis is de­bat­able for the last gen­er­a­tion of LLMs

7 Nov 2023 16:12 UTC
52 points
20 comments6 min readLW link

Pre­face to the Se­quence on LLM Psychology

Quentin FEUILLADE--MONTIXI7 Nov 2023 16:12 UTC
33 points
0 comments2 min readLW link

What I’ve been read­ing, Novem­ber 2023

jasoncrawford7 Nov 2023 13:37 UTC
23 points
1 comment5 min readLW link
(rootsofprogress.org)

AI Align­ment [Progress] this Week (11/​05/​2023)

Logan Zoellner7 Nov 2023 13:26 UTC
24 points
0 comments4 min readLW link
(midwitalignment.substack.com)

On the UK Summit

Zvi7 Nov 2023 13:10 UTC
74 points
6 comments30 min readLW link
(thezvi.wordpress.com)

Box in­ver­sion revisited

Jan_Kulveit7 Nov 2023 11:09 UTC
40 points
3 comments8 min readLW link

AI Align­ment Re­search Eng­ineer Ac­cel­er­a­tor (ARENA): call for applicants

CallumMcDougall7 Nov 2023 9:43 UTC
56 points
0 comments1 min readLW link

The Per­ils of Professionalism

Screwtape7 Nov 2023 0:07 UTC
43 points
1 comment10 min readLW link

How to (hope­fully eth­i­cally) make money off of AGI

6 Nov 2023 23:35 UTC
142 points
88 comments32 min readLW link1 review

cost es­ti­ma­tion for 2 grid en­ergy stor­age systems

bhauth6 Nov 2023 23:32 UTC
16 points
12 comments7 min readLW link
(www.bhauth.com)

A bet on crit­i­cal pe­ri­ods in neu­ral networks

6 Nov 2023 23:21 UTC
24 points
1 comment6 min readLW link

Job list­ing: Com­mu­ni­ca­tions Gen­er­al­ist /​ Pro­ject Manager

Gretta Duleba6 Nov 2023 20:21 UTC
49 points
7 comments1 min readLW link

Aske­sis: a model of the cerebellum

MadHatter6 Nov 2023 20:19 UTC
7 points
2 comments1 min readLW link
(github.com)

LQPR: An Al­gorithm for Re­in­force­ment Learn­ing with Prov­able Safety Guarantees

MadHatter6 Nov 2023 20:17 UTC
6 points
0 comments1 min readLW link
(github.com)

ACX Meetup Leipzig

Roman Leipe6 Nov 2023 18:33 UTC
1 point
0 comments1 min readLW link

[Question] Does bulemia work?

lc6 Nov 2023 17:58 UTC
6 points
18 comments1 min readLW link

Why build­ing ven­tures in AI Safety is par­tic­u­larly challenging

Heramb6 Nov 2023 16:27 UTC
1 point
0 comments1 min readLW link
(forum.effectivealtruism.org)