[Question] Is In­struc­tGPT Fol­low­ing In­struc­tions in Other Lan­guages Sur­pris­ing?

DragonGod13 Feb 2023 23:26 UTC
39 points
15 comments1 min readLW link

LLM Ba­sics: Embed­ding Spaces—Trans­former To­ken Vec­tors Are Not Points in Space

NickyP13 Feb 2023 18:52 UTC
79 points
11 comments15 min readLW link

4 ways to think about de­moc­ra­tiz­ing AI [GovAI Linkpost]

Akash13 Feb 2023 18:06 UTC
24 points
4 comments1 min readLW link
(www.governance.ai)

Does the AGPL Work?

jefftk13 Feb 2023 14:20 UTC
13 points
12 comments2 min readLW link
(www.jefftk.com)

H5N1

Zvi13 Feb 2023 12:50 UTC
101 points
1 comment9 min readLW link
(thezvi.wordpress.com)

En­joy LessWrong in ebook format

Bart Bussmann13 Feb 2023 11:53 UTC
53 points
2 comments1 min readLW link

Mor­pholog­i­cal in­tel­li­gence, su­per­hu­man em­pa­thy, and eth­i­cal arbitration

Roman Leventov13 Feb 2023 10:25 UTC
1 point
0 comments2 min readLW link

South Bay ACX/​LW Meetup

IS13 Feb 2023 6:08 UTC
3 points
0 comments1 min readLW link

Idea: Net­work mod­u­lar­ity and in­ter­pretabil­ity by sex­ual reproduction

qbolec12 Feb 2023 23:06 UTC
3 points
3 comments1 min readLW link

The End of Anonymity Online

Spiorad12 Feb 2023 21:23 UTC
3 points
9 comments2 min readLW link

Matt Clancy AMA on the Progress Forum

jasoncrawford12 Feb 2023 20:23 UTC
17 points
0 comments1 min readLW link
(progressforum.org)

La­tent vari­ables for pre­dic­tion mar­kets: mo­ti­va­tion, tech­ni­cal guide, and de­sign considerations

tailcalled12 Feb 2023 17:54 UTC
100 points
18 comments23 min readLW link1 review

The con­cep­tual Dop­pelgänger problem

TsviBT12 Feb 2023 17:23 UTC
12 points
5 comments4 min readLW link

How Car­dioid Are Car­dioids?

jefftk12 Feb 2023 16:20 UTC
9 points
0 comments2 min readLW link
(www.jefftk.com)

How many of these jobs will have a 15% or more drop in em­ploy­ment plau­si­bly at­tributable to AI by 2031?

tailcalled12 Feb 2023 15:40 UTC
12 points
5 comments1 min readLW link
(manifold.markets)

Hu­man-AI col­lab­o­ra­tive writing

DirectedEvolution12 Feb 2023 14:57 UTC
20 points
2 comments5 min readLW link

RaD-AI workshop

Ram Rachum12 Feb 2023 12:46 UTC
3 points
0 comments1 min readLW link

Ele­ments of Ra­tion­al­ist Discourse

Rob Bensinger12 Feb 2023 7:58 UTC
223 points
49 comments3 min readLW link1 review

Con­flict The­ory of Bounded Distrust

Zack_M_Davis12 Feb 2023 5:30 UTC
108 points
30 comments3 min readLW link1 review

Why al­most ev­ery RL agent does learned optimization

Lee Sharkey12 Feb 2023 4:58 UTC
32 points
3 comments5 min readLW link

How I Learn From Textbooks

DirectedEvolution12 Feb 2023 4:45 UTC
24 points
3 comments8 min readLW link

Top YouTube chan­nel Ver­i­ta­sium re­leases video on Sleep­ing Beauty Problem

Alex_Altair11 Feb 2023 20:36 UTC
25 points
22 comments1 min readLW link
(www.youtube.com)

Short­en­ing Timelines: There’s No Buffer Anymore

Jeff Rose11 Feb 2023 19:53 UTC
10 points
5 comments1 min readLW link

We Found An Neu­ron in GPT-2

11 Feb 2023 18:27 UTC
143 points
23 comments7 min readLW link
(clementneo.com)

The Prac­ti­tioner’s Path 2.0: the Prag­ma­tist Archetype

Evenflair11 Feb 2023 15:48 UTC
21 points
0 comments2 min readLW link
(guildoftherose.org)

The Illu­sion of Sim­plic­ity: Mone­tary Policy as a Prob­lem of Com­plex­ity and Alignment

Edward P. Könings11 Feb 2023 15:04 UTC
8 points
0 comments8 min readLW link
(edwardknings.substack.com)

In Defense of Chat­bot Romance

Kaj_Sotala11 Feb 2023 14:30 UTC
123 points
52 comments11 min readLW link
(kajsotala.fi)

Threat­en­ing to do the im­pos­si­ble: A solu­tion to spu­ri­ous coun­ter­fac­tu­als for func­tional de­ci­sion the­ory via proof theory

Christopher King11 Feb 2023 7:57 UTC
5 points
4 comments5 min readLW link

Ra­tion­al­ity-re­lated things I don’t know as of 2023

Adam Zerner11 Feb 2023 6:04 UTC
64 points
59 comments3 min readLW link

A note on ‘semiotic physics’

metasemi11 Feb 2023 5:12 UTC
11 points
13 comments6 min readLW link

Inequal­ity Penalty: Mo­ral­ity in Many Worlds

Shmi11 Feb 2023 4:08 UTC
11 points
17 comments6 min readLW link

The Im­por­tance of AI Align­ment, ex­plained in 5 points

Daniel_Eth11 Feb 2023 2:56 UTC
33 points
2 comments1 min readLW link

Act­ing Nor­mal is Good, Actually

Gordon Seidoh Worley10 Feb 2023 23:35 UTC
14 points
5 comments3 min readLW link

[S] D&D.Sci: All the D8a. Allllllll of it.

aphyer10 Feb 2023 21:14 UTC
43 points
17 comments6 min readLW link

A Differ­ent Kind of Ark: My failed at­tempt to build a bridge be­tween universes

ChrisM10 Feb 2023 20:49 UTC
2 points
2 comments6 min readLW link
(www.vesselproject.io)

Prizes for the 2021 Review

Raemon10 Feb 2023 19:47 UTC
69 points
2 comments4 min readLW link

A pro­posed method for fore­cast­ing trans­for­ma­tive AI

Matthew Barnett10 Feb 2023 19:34 UTC
121 points
21 comments10 min readLW link

The best way so far to ex­plain AI risk: The Precipice (p. 137-149)

trevor10 Feb 2023 19:33 UTC
50 points
2 comments17 min readLW link

Is this a weak pivotal act: cre­at­ing nanobots that eat evil AGIs (but noth­ing else)?

Christopher King10 Feb 2023 19:26 UTC
0 points
3 comments1 min readLW link

Why I’m not work­ing on {de­bate, RRM, ELK, nat­u­ral ab­strac­tions}

Steven Byrnes10 Feb 2023 19:22 UTC
71 points
19 comments9 min readLW link

Con­di­tion­ing Pre­dic­tive Models: Open prob­lems, Con­clu­sion, and Appendix

10 Feb 2023 19:21 UTC
36 points
3 comments11 min readLW link

Jobs that can help with the most im­por­tant century

HoldenKarnofsky10 Feb 2023 18:20 UTC
24 points
0 comments19 min readLW link
(www.cold-takes.com)

[Question] Is it a co­in­ci­dence that GPT-3 re­quires roughly the same amount of com­pute as is nec­es­sary to em­u­late the hu­man brain?

RomanS10 Feb 2023 16:26 UTC
11 points
10 comments1 min readLW link

Con­tra: Chang­ing Role Terms

jefftk10 Feb 2023 15:00 UTC
8 points
0 comments3 min readLW link
(www.jefftk.com)

Cyborgism

10 Feb 2023 14:47 UTC
337 points
46 comments35 min readLW link

FLI Pod­cast: Con­nor Leahy on AI Progress, Chimps, Memes, and Mar­kets (Part 1/​3)

10 Feb 2023 13:55 UTC
39 points
0 comments43 min readLW link

Many im­por­tant tech­nolo­gies start out as sci­ence fic­tion be­fore be­com­ing real

trevor10 Feb 2023 9:36 UTC
28 points
2 comments2 min readLW link

[Question] What’s ac­tu­ally go­ing on in the “mind” of the model when we fine-tune GPT-3 to In­struc­tGPT?

rpglover6410 Feb 2023 7:57 UTC
18 points
3 comments1 min readLW link

Mechanism De­sign for AI Safety—Agenda Creation Retreat

Rubi J. Hudson10 Feb 2023 3:05 UTC
24 points
2 comments1 min readLW link

[Question] On util­ity functions

jodaru10 Feb 2023 1:22 UTC
11 points
10 comments1 min readLW link