[Question] What’s your best util­i­tar­ian model for risk­ing your best kid­neys?

Ilio12 Nov 2023 23:01 UTC
−3 points
4 comments1 min readLW link

Helpful ex­am­ples to get a sense of mod­ern au­to­mated manipulation

trevor12 Nov 2023 20:49 UTC
33 points
3 comments9 min readLW link

The Snug­gle/​Date/​Slap Protocol

MadHatter12 Nov 2023 20:44 UTC
−21 points
4 comments2 min readLW link

Two chil­dren’s stories

Optimization Process12 Nov 2023 20:29 UTC
11 points
1 comment7 min readLW link

The Fun­da­men­tal The­o­rem for mea­surable fac­tor spaces

Matthias G. Mayer12 Nov 2023 19:25 UTC
38 points
2 comments2 min readLW link

How ac­cu­rate are stan­dard Dark Triad per­son­al­ity scales?

jamesbill12 Nov 2023 8:21 UTC
0 points
2 comments2 min readLW link

[Question] What ML gears do you like?

Ulisse Mini11 Nov 2023 19:10 UTC
25 points
4 comments1 min readLW link

Smart Ses­sions—Fi­nally a (kinda) win­dow-cen­tric ses­sion manager

Eli Tyre11 Nov 2023 18:54 UTC
13 points
3 comments5 min readLW link

AISC pro­ject: Satis­fIA – AI that satis­fies with­out over­do­ing it

Jobst Heitzig11 Nov 2023 18:22 UTC
12 points
0 comments1 min readLW link
(docs.google.com)

Con­trol Sym­me­try: why we might want to start in­ves­ti­gat­ing asym­met­ric al­ign­ment interventions

domenicrosati11 Nov 2023 17:27 UTC
25 points
1 comment2 min readLW link

Game The­ory with­out Argmax [Part 2]

Cleo Nardo11 Nov 2023 16:02 UTC
31 points
14 comments13 min readLW link

Game The­ory with­out Argmax [Part 1]

Cleo Nardo11 Nov 2023 15:59 UTC
69 points
18 comments19 min readLW link

It’s OK to be bi­ased to­wards humans

dr_s11 Nov 2023 11:59 UTC
55 points
69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Lat­est Recommendations

Dawn Drescher11 Nov 2023 9:04 UTC
3 points
2 comments1 min readLW link

Ar­tifi­cial Gen­eral Horsiness

robotelvis11 Nov 2023 5:15 UTC
4 points
0 comments5 min readLW link
(messyprogress.substack.com)

Pal­isade is hiring Re­search Engineers

11 Nov 2023 3:09 UTC
23 points
0 comments3 min readLW link

Open Phil re­leases RFPs on LLM Bench­marks and Forecasting

LawrenceC11 Nov 2023 3:01 UTC
53 points
0 comments2 min readLW link
(www.openphilanthropy.org)

Memo on some ne­glected topics

Lukas Finnveden11 Nov 2023 2:01 UTC
28 points
2 comments1 min readLW link
(open.substack.com)

Who is Sam Bankman-Fried (SBF) re­ally, and how could he have done what he did? - three the­o­ries and a lot of evidence

spencerg11 Nov 2023 1:04 UTC
36 points
28 comments1 min readLW link
(www.spencergreenberg.com)

Sur­vey on the ac­cel­er­a­tion risks of our new RFPs to study LLM capabilities

Ajeya Cotra10 Nov 2023 23:59 UTC
27 points
1 comment1 min readLW link

Rat Fest 2024

LoganChipkin10 Nov 2023 23:25 UTC
7 points
6 comments1 min readLW link

How I Think, Part Three: Weigh­ing Cryonics

Richard Henage10 Nov 2023 22:21 UTC
4 points
1 comment2 min readLW link

Lin­ear en­cod­ing of char­ac­ter-level in­for­ma­tion in GPT-J to­ken embeddings

10 Nov 2023 22:19 UTC
34 points
4 comments28 min readLW link

Fol­low-up sur­vey: inositol

Elizabeth10 Nov 2023 19:30 UTC
13 points
1 comment1 min readLW link
(acesounderglass.com)

We have promis­ing al­ign­ment plans with low taxes

Seth Herd10 Nov 2023 18:51 UTC
33 points
9 comments5 min readLW link

[Question] Vec­tor search on a large dataset?

camsdixon10 Nov 2023 18:43 UTC
−1 points
2 comments1 min readLW link

About Me

Abe Dillon10 Nov 2023 18:32 UTC
3 points
0 comments1 min readLW link

Me­tac­u­lus In­tro­duces AI-Pow­ered Com­mu­nity In­sights to Re­veal Fac­tors Driv­ing User Forecasts

ChristianWilliams10 Nov 2023 17:57 UTC
6 points
0 comments1 min readLW link
(www.metaculus.com)

Joy in the Here and Real

Screwtape10 Nov 2023 17:22 UTC
18 points
0 comments2 min readLW link

Arte­facts gen­er­ated by mode col­lapse in GPT-4 Turbo serve as ad­ver­sar­ial at­tacks.

Sohaib Imran10 Nov 2023 15:23 UTC
11 points
0 comments2 min readLW link

Wastew­a­ter RNA Read Lengths

jefftk10 Nov 2023 15:20 UTC
13 points
0 comments4 min readLW link
(www.jefftk.com)

Up­date on the UK AI Sum­mit and the UK’s Plans

Elliot Mckernon10 Nov 2023 14:47 UTC
11 points
0 comments8 min readLW link

Liv Bo­eree Ted Talk Moloch & AI

Neil 10 Nov 2023 14:04 UTC
10 points
2 comments1 min readLW link
(m.youtube.com)

Pick­ing Men­tors For Re­search Programmes

Raymond D10 Nov 2023 13:01 UTC
106 points
8 comments4 min readLW link

GPT-2030 and Catas­trophic Drives: Four Vignettes

jsteinhardt10 Nov 2023 7:30 UTC
50 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Crock, Crocker, Crockiest

Screwtape10 Nov 2023 6:14 UTC
21 points
4 comments6 min readLW link

AI Timelines

10 Nov 2023 5:28 UTC
279 points
96 comments51 min readLW link

ACI#6: A Non-Dual­is­tic ACI Model

Akira Pyinya9 Nov 2023 23:01 UTC
10 points
2 comments6 min readLW link

How I got so ex­cited about HowTruthful

Bruce Lewis9 Nov 2023 18:49 UTC
17 points
3 comments5 min readLW link

The case for “Gen­er­ous Tit for Tat” as the ul­ti­mate game the­ory strategy

positivesum9 Nov 2023 18:41 UTC
2 points
3 comments8 min readLW link
(tryingtruly.substack.com)

Text Posts from the Kids Group: 2021

jefftk9 Nov 2023 17:50 UTC
38 points
1 comment8 min readLW link
(www.jefftk.com)

AI #37: Mov­ing Too Fast

Zvi9 Nov 2023 17:50 UTC
53 points
5 comments76 min readLW link
(thezvi.wordpress.com)

Learn­ing-the­o­retic agenda read­ing list

Vanessa Kosoy9 Nov 2023 17:25 UTC
98 points
0 comments2 min readLW link

​​ Open-ended/​Phenom­e­nal ​Ethics ​(TLDR)

Ryo 9 Nov 2023 16:58 UTC
3 points
0 comments1 min readLW link

Poly­se­man­tic At­ten­tion Head in a 4-Layer Transformer

9 Nov 2023 16:16 UTC
51 points
0 comments6 min readLW link

On OpenAI Dev Day

Zvi9 Nov 2023 16:10 UTC
60 points
0 comments15 min readLW link
(thezvi.wordpress.com)

An­trop­i­cal Prob­a­bil­ities Are Fully Ex­plained by Differ­ence in Pos­si­ble Outcomes

Ape in the coat9 Nov 2023 15:34 UTC
17 points
2 comments5 min readLW link

A free to en­ter, 240 char­ac­ter, open-source iter­ated pris­oner’s dilemma tournament

Isaac King9 Nov 2023 8:24 UTC
64 points
19 comments1 min readLW link
(manifold.markets)

Into AI Safety Epi­sodes 1 & 2

jacobhaimes9 Nov 2023 4:36 UTC
2 points
0 comments1 min readLW link
(into-ai-safety.github.io)

Mak­ing Bad De­ci­sions On Purpose

Screwtape9 Nov 2023 3:36 UTC
48 points
8 comments5 min readLW link