What’s A “Mar­ket”?

johnswentworth8 Aug 2023 23:29 UTC
94 points
16 comments10 min readLW link

Pod­cast (+tran­script): Nathan Barnard on how US fi­nan­cial reg­u­la­tion can in­form AI governance

Aaron Bergman8 Aug 2023 21:46 UTC
8 points
0 comments1 min readLW link
(www.aaronbergman.net)

What are the flaws in this ar­gu­ment about p(Doom)?

William the Kiwi 8 Aug 2023 20:34 UTC
0 points
25 comments1 min readLW link

A Sim­ple The­ory Of Consciousness

SherlockHolmes8 Aug 2023 18:05 UTC
2 points
5 comments1 min readLW link
(peterholmes.medium.com)

[Linkpost] Ra­tion­ally awake

jpc8 Aug 2023 17:59 UTC
−1 points
0 comments4 min readLW link
(jpc.dev)

Yet more UFO Bet­ting: Put Up or Shut Up

MoreRatsWrongReUAP8 Aug 2023 17:50 UTC
10 points
18 comments1 min readLW link

AISN #18: Challenges of Re­in­force­ment Learn­ing from Hu­man Feed­back, Microsoft’s Se­cu­rity Breach, and Con­cep­tual Re­search on AI Safety

aogara8 Aug 2023 15:52 UTC
13 points
0 comments1 min readLW link
(newsletter.safe.ai)

[Question] Begin­ner’s ques­tion about RLHF

FTPickle8 Aug 2023 15:48 UTC
1 point
3 comments1 min readLW link

My Trial Pe­riod as an In­de­pen­dent Align­ment Researcher

Bart Bussmann8 Aug 2023 14:16 UTC
34 points
1 comment3 min readLW link

4 types of AGI se­lec­tion, and how to con­strain them

Remmelt8 Aug 2023 10:02 UTC
−4 points
3 comments3 min readLW link

No­tice your everything

metachirality8 Aug 2023 2:38 UTC
15 points
1 comment2 min readLW link

Model Or­ganisms of Misal­ign­ment: The Case for a New Pillar of Align­ment Research

8 Aug 2023 1:30 UTC
312 points
28 comments18 min readLW link

Per­pet­u­ally De­clin­ing Pop­u­la­tion?

jefftk8 Aug 2023 1:30 UTC
48 points
29 comments3 min readLW link
(www.jefftk.com)

[Question] How do I find all the items on LW that I’ve *fa­vor­ited* or up­voted?

Alex K. Chen (parrot)7 Aug 2023 23:51 UTC
14 points
3 comments1 min readLW link

A plea for more fund­ing short­fall transparency

porby7 Aug 2023 21:33 UTC
73 points
4 comments2 min readLW link

[Question] Tips for re­duc­ing think­ing branch­ing factor

Simon Berens7 Aug 2023 20:21 UTC
4 points
6 comments1 min readLW link

An in­ter­ac­tive in­tro­duc­tion to grokking and mechanis­tic interpretability

7 Aug 2023 19:09 UTC
23 points
3 comments1 min readLW link
(pair.withgoogle.com)

Feed­back­loop-first Rationality

Raemon7 Aug 2023 17:58 UTC
192 points
65 comments8 min readLW link

Grow­ing Bon­sai Net­works with RNNs

ameo7 Aug 2023 17:34 UTC
21 points
5 comments1 min readLW link
(cprimozic.net)

[Question] Should I test my­self for microplas­tics?

Augs7 Aug 2023 17:31 UTC
9 points
2 comments1 min readLW link

Op­ti­mi­sa­tion Mea­sures: Desider­ata, Im­pos­si­bil­ity, Proposals

7 Aug 2023 15:52 UTC
35 points
9 comments1 min readLW link

An­nounc­ing the Clearer Think­ing micro-grants pro­gram for 2023

spencerg7 Aug 2023 15:21 UTC
14 points
1 comment1 min readLW link
(www.clearerthinking.org)

What I’ve been read­ing, July–Au­gust 2023

jasoncrawford7 Aug 2023 14:22 UTC
23 points
0 comments13 min readLW link
(rootsofprogress.org)

Monthly Roundup #9: Au­gust 2023

Zvi7 Aug 2023 13:20 UTC
42 points
25 comments57 min readLW link
(thezvi.wordpress.com)

Strength­en­ing the Ar­gu­ment for In­trin­sic AI Safety: The S-Curves Per­spec­tive

avturchin7 Aug 2023 13:13 UTC
8 points
0 comments12 min readLW link

Overview of how AI might ex­ac­er­bate long-run­ning catas­trophic risks

Hauke Hillebrandt7 Aug 2023 11:53 UTC
20 points
0 comments11 min readLW link
(aisafetyfundamentals.com)

The sec­ond act: Begin­ning epistemic rigor at 30

hiAndrewQuinn7 Aug 2023 9:34 UTC
6 points
0 comments3 min readLW link

Drinks at a bar

yakimoff7 Aug 2023 2:52 UTC
3 points
0 comments1 min readLW link

Prob­lems with Robin Han­son’s Quillette Ar­ti­cle On AI

DaemonicSigil6 Aug 2023 22:13 UTC
89 points
33 comments8 min readLW link

Yann LeCun on AGI and AI Safety

Chris_Leong6 Aug 2023 21:56 UTC
37 points
13 comments1 min readLW link
(drive.google.com)

Com­pu­ta­tional Thread Art

CallumMcDougall6 Aug 2023 21:42 UTC
75 points
2 comments6 min readLW link

‘We’re chang­ing the clouds.’ An un­fore­seen test of geo­eng­ineer­ing is fuel­ing record ocean warmth

Annapurna6 Aug 2023 20:58 UTC
60 points
6 comments1 min readLW link
(www.science.org)

[Linkpost] Will AI avoid ex­ploita­tion?

cdkg6 Aug 2023 14:28 UTC
22 points
1 comment1 min readLW link

Re­duc­ing the risk of catas­troph­i­cally mis­al­igned AI by avoid­ing the Sin­gle­ton sce­nario: the Many­ton Variant

GravitasGradient6 Aug 2023 14:24 UTC
−6 points
0 comments3 min readLW link

Re­boot­ing AI Gover­nance: An AI-Driven Ap­proach to AI Governance

utilon6 Aug 2023 14:19 UTC
1 point
1 comment29 min readLW link
(forum.effectivealtruism.org)

Model-Based Policy Anal­y­sis un­der Deep Uncertainty

utilon6 Aug 2023 14:07 UTC
14 points
1 comment23 min readLW link
(forum.effectivealtruism.org)

[Question] On be­ing in a bad place and too stub­born to leave.

TeaTieAndHat6 Aug 2023 11:45 UTC
12 points
14 comments3 min readLW link

Safety-First Agents/​Ar­chi­tec­tures Are a Promis­ing Path to Safe AGI

Brendon_Wong6 Aug 2023 8:02 UTC
13 points
2 comments12 min readLW link

The Benev­olent Ruler’s Hand­book (Part 1): The Policy Problem

FCCC6 Aug 2023 3:46 UTC
11 points
3 comments4 min readLW link

Ex­plor­ing the Mul­ti­verse of Large Lan­guage Models

franky6 Aug 2023 2:38 UTC
1 point
0 comments5 min readLW link

Align­ing my web server with de­vops prac­tices: part 2 (se­cu­rity)

VipulNaik6 Aug 2023 1:30 UTC
6 points
0 comments19 min readLW link

Sum­mary of Im­prov­ing Global De­ci­sion Mak­ing (around AI)

Will_Pearson5 Aug 2023 18:46 UTC
−7 points
0 comments1 min readLW link

Ground-Truth La­bel Im­bal­ance Im­pairs the Perfor­mance of Con­trast-Con­sis­tent Search (and Other Con­trast-Pair-Based Un­su­per­vised Meth­ods)

5 Aug 2023 17:55 UTC
6 points
2 comments7 min readLW link
(drive.google.com)

Seat­tle As­tral Codex Ten Monthly Social

a7x5 Aug 2023 17:55 UTC
1 point
0 comments1 min readLW link

AISafety.info’s Writ­ing & Edit­ing Hackathon

smallsilo5 Aug 2023 17:14 UTC
2 points
0 comments1 min readLW link

Join AISafety.info’s Writ­ing & Edit­ing Hackathon (Aug 25-28) (Prizes to be won!)

smallsilo5 Aug 2023 14:08 UTC
19 points
3 comments1 min readLW link
(forum.effectivealtruism.org)

Stomach Ulcers and Den­tal Cavities

Metacelsus5 Aug 2023 14:08 UTC
56 points
7 comments1 min readLW link
(denovo.substack.com)

video games > IQ tests

bhauth5 Aug 2023 13:27 UTC
35 points
45 comments3 min readLW link

[Linkpost] Ap­pli­ca­bil­ity of scal­ing laws to vi­sion en­cod­ing models

Bogdan Ionut Cirstea5 Aug 2023 11:10 UTC
11 points
2 comments1 min readLW link

A Naive Pro­posal for Con­struct­ing In­ter­pretable AI

Chris_Leong5 Aug 2023 10:32 UTC
18 points
6 comments2 min readLW link