RSS

OpenAI

TagLast edit: 27 Aug 2022 18:11 UTC by Multicore

OpenAI is an organisation that performs AI research, and houses a substantial amount of AI alignment research. Its stated mission is “Discovering and enacting the path to safe artificial general intelligence.”

This tag is for explicit discussion of the organisation, not for all work published by researchers at that organisation.

See also:

OpenAI projects: GPT, DALL-E

Other related tags: Language Models, Machine Learning

Public-fac­ing Cen­sor­ship Is Safety Theater, Caus­ing Rep­u­ta­tional Da­m­age

Yitz23 Sep 2022 5:08 UTC
149 points
42 comments6 min readLW link

An OpenAI board seat is sur­pris­ingly expensive

Benquo19 Apr 2017 9:05 UTC
12 points
16 comments1 min readLW link

Com­mon mis­con­cep­tions about OpenAI

Jacob_Hilton25 Aug 2022 14:02 UTC
251 points
147 comments5 min readLW link1 review

Mul­ti­modal Neu­rons in Ar­tifi­cial Neu­ral Networks

Kaj_Sotala5 Mar 2021 9:01 UTC
57 points
2 comments2 min readLW link
(distill.pub)

OpenAI an­nounces GPT-3

gwern29 May 2020 1:49 UTC
67 points
23 comments1 min readLW link
(arxiv.org)

OpenAI charter

wunan9 Apr 2018 21:02 UTC
17 points
2 comments1 min readLW link
(blog.openai.com)

OpenAI makes hu­man­ity less safe

Benquo3 Apr 2017 19:07 UTC
71 points
109 comments6 min readLW link

the scal­ing “in­con­sis­tency”: openAI’s new insight

nostalgebraist7 Nov 2020 7:40 UTC
148 points
14 comments9 min readLW link
(nostalgebraist.tumblr.com)

GPT-4 Plugs In

Zvi27 Mar 2023 12:10 UTC
198 points
47 comments6 min readLW link
(thezvi.wordpress.com)

[Question] Will OpenAI’s work un­in­ten­tion­ally in­crease ex­is­ten­tial risks re­lated to AI?

adamShimi11 Aug 2020 18:16 UTC
53 points
55 comments1 min readLW link

Tran­script of Sam Alt­man’s in­ter­view touch­ing on AI safety

Andy_McKenzie20 Jan 2023 16:14 UTC
121 points
42 comments10 min readLW link

Devel­op­men­tal Stages of GPTs

orthonormal26 Jul 2020 22:03 UTC
140 points
72 comments7 min readLW link1 review

Sam Alt­man’s sister, An­nie Alt­man, claims Sam has severely abused her

prometheus50157 Oct 2023 21:06 UTC
98 points
107 comments183 min readLW link

DALL-E by OpenAI

Daniel Kokotajlo5 Jan 2021 20:05 UTC
97 points
20 comments1 min readLW link

A challenge for AGI or­ga­ni­za­tions, and a challenge for readers

1 Dec 2022 23:11 UTC
301 points
33 comments2 min readLW link

[Question] How will OpenAI + GitHub’s Copi­lot af­fect pro­gram­ming?

smountjoy29 Jun 2021 16:42 UTC
55 points
24 comments1 min readLW link

[Linkpost] In­tro­duc­ing Superalignment

beren5 Jul 2023 18:23 UTC
174 points
69 comments1 min readLW link
(openai.com)

DALL·E 2 by OpenAI

P.6 Apr 2022 14:17 UTC
44 points
49 comments1 min readLW link
(openai.com)

BIG-Bench Ca­nary Con­tam­i­na­tion in GPT-4

Jozdien22 Oct 2024 15:40 UTC
123 points
13 comments4 min readLW link

An at­tempt to steel­man OpenAI’s al­ign­ment plan

Nathan Helm-Burger13 Jul 2023 18:25 UTC
22 points
0 comments4 min readLW link

[Question] What should OpenAI do that it hasn’t already done, to stop their va­can­cies from be­ing ad­ver­tised on the 80k Job Board?

WitheringWeights21 Oct 2024 13:57 UTC
21 points
0 comments1 min readLW link

Fron­tier Model Forum

Zach Stein-Perlman26 Jul 2023 14:30 UTC
27 points
0 comments4 min readLW link
(blog.google)

AXRP Epi­sode 24 - Su­per­al­ign­ment with Jan Leike

DanielFilan27 Jul 2023 4:00 UTC
55 points
3 comments69 min readLW link

OpenAI: He­len Toner Speaks

Zvi30 May 2024 21:10 UTC
86 points
8 comments13 min readLW link
(thezvi.wordpress.com)

Non-Dis­par­age­ment Ca­naries for OpenAI

30 May 2024 19:20 UTC
287 points
51 comments2 min readLW link

Unit eco­nomics of LLM APIs

27 Aug 2024 16:51 UTC
42 points
0 comments2 min readLW link

What is OpenAI’s plan for mak­ing AI Safer?

brook1 Sep 2023 11:15 UTC
6 points
0 comments4 min readLW link
(aisafetyexplained.substack.com)

Dall-E 3

p.b.2 Oct 2023 20:33 UTC
37 points
9 comments1 min readLW link
(openai.com)

“Suc­cess­ful lan­guage model evals” by Ja­son Wei

Arjun Panickssery25 May 2024 9:34 UTC
7 points
0 comments1 min readLW link
(www.jasonwei.net)

A crazy hy­poth­e­sis: GPT-4 already is agen­tic and is try­ing to take over the world!

Christopher King24 Mar 2023 1:19 UTC
−2 points
11 comments9 min readLW link

Sam Alt­man on GPT-4, ChatGPT, and the Fu­ture of AI | Lex Frid­man Pod­cast #367

Gabe M25 Mar 2023 19:08 UTC
63 points
4 comments2 min readLW link
(www.youtube.com)

OpenAI Codex: First Impressions

specbug13 Aug 2021 16:52 UTC
49 points
8 comments4 min readLW link
(sixeleven.in)

[Question] How to think about and deal with OpenAI

Rafael Harth9 Oct 2021 13:10 UTC
110 points
68 comments1 min readLW link

How do new mod­els from OpenAI, Deep­Mind and An­thropic perform on Truth­fulQA?

Owain_Evans26 Feb 2022 12:46 UTC
44 points
3 comments11 min readLW link

dalle2 comments

nostalgebraist26 Apr 2022 5:30 UTC
183 points
14 comments13 min readLW link
(nostalgebraist.tumblr.com)

An­thropic AI made the right call

bhauth15 Apr 2024 0:39 UTC
22 points
20 comments1 min readLW link

[Link] OpenAI: Learn­ing to Play Minecraft with Video PreTrain­ing (VPT)

Aryeh Englander23 Jun 2022 16:29 UTC
53 points
3 comments1 min readLW link

OpenAI’s Align­ment Plans

dkirmani24 Aug 2022 19:39 UTC
60 points
17 comments5 min readLW link
(openai.com)

[Question] Can GPT-4 play 20 ques­tions against an­other in­stance of it­self?

Nathan Helm-Burger28 Mar 2023 1:11 UTC
15 points
1 comment1 min readLW link
(evanthebouncy.medium.com)

Epi­sode: Austin vs Linch on OpenAI

Austin Chen25 May 2024 16:15 UTC
20 points
25 comments1 min readLW link
(manifund.substack.com)

Do Not Mess With Scar­lett Johansson

Zvi22 May 2024 15:10 UTC
65 points
7 comments16 min readLW link
(thezvi.wordpress.com)

An eval­u­a­tion of He­len Toner’s in­ter­view on the TED AI Show

PeterH6 Jun 2024 17:39 UTC
24 points
2 comments30 min readLW link

On Dwarkesh’s Pod­cast with OpenAI’s John Schulman

Zvi21 May 2024 17:30 UTC
73 points
4 comments20 min readLW link
(thezvi.wordpress.com)

Re­quest to AGI or­ga­ni­za­tions: Share your views on paus­ing AI progress

11 Apr 2023 17:30 UTC
141 points
11 comments1 min readLW link

OpenAI’s GPT-4 Safety Goals

PeterMcCluskey22 Apr 2023 19:11 UTC
3 points
3 comments4 min readLW link
(bayesianinvestor.com)

My thoughts on OpenAI’s Align­ment plan

Donald Hobson10 Dec 2022 10:35 UTC
25 points
1 comment6 min readLW link

OpenAI re­leases GPT-4o, na­tively in­ter­fac­ing with text, voice and vision

Martín Soto13 May 2024 18:50 UTC
54 points
23 comments1 min readLW link
(openai.com)

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-Perlman15 May 2024 0:45 UTC
246 points
95 comments2 min readLW link

A Brief Assess­ment of OpenAI’s Pre­pared­ness Frame­work & Some Sugges­tions for Improvement

simeon_c22 Jan 2024 20:08 UTC
14 points
0 comments6 min readLW link
(uploads-ssl.webflow.com)

My thoughts on OpenAI’s al­ign­ment plan

Akash30 Dec 2022 19:33 UTC
55 points
3 comments20 min readLW link

On OpenAI Dev Day

Zvi9 Nov 2023 16:10 UTC
60 points
0 comments15 min readLW link
(thezvi.wordpress.com)

[Linkpost] Jan Leike on three kinds of al­ign­ment taxes

Akash6 Jan 2023 23:57 UTC
27 points
2 comments3 min readLW link
(aligned.substack.com)

Microsoft Plans to In­vest $10B in OpenAI; $3B In­vested to Date | For­tune

DragonGod12 Jan 2023 3:55 UTC
23 points
10 comments2 min readLW link
(fortune.com)

OpenAI ap­points Re­tired U.S. Army Gen­eral Paul M. Naka­sone to Board of Directors

Joel Burget13 Jun 2024 21:28 UTC
35 points
10 comments1 min readLW link
(openai.com)

Sam Alt­man fired from OpenAI

LawrenceC17 Nov 2023 20:42 UTC
192 points
75 comments1 min readLW link
(openai.com)

OpenAI/​Microsoft an­nounce “next gen­er­a­tion lan­guage model” in­te­grated into Bing/​Edge

LawrenceC7 Feb 2023 20:38 UTC
79 points
4 comments1 min readLW link
(blogs.microsoft.com)

OpenAI #8: The Right to Warn

Zvi17 Jun 2024 12:00 UTC
97 points
8 comments34 min readLW link
(thezvi.wordpress.com)

On Deep­Mind’s Fron­tier Safety Framework

Zvi18 Jun 2024 13:30 UTC
37 points
4 comments8 min readLW link
(thezvi.wordpress.com)

On OpenAI’s Model Spec

Zvi21 Jun 2024 13:00 UTC
46 points
3 comments30 min readLW link
(thezvi.wordpress.com)

Dialogue on the Claim: “OpenAI’s Firing of Sam Alt­man (And Shortly-Sub­se­quent Events) On Net Re­duced Ex­is­ten­tial Risk From AGI”

21 Nov 2023 17:39 UTC
73 points
84 comments11 min readLW link

Pos­si­ble OpenAI’s Q* break­through and Deep­Mind’s AlphaGo-type sys­tems plus LLMs

Burny23 Nov 2023 3:16 UTC
37 points
25 comments2 min readLW link

[Question] why did OpenAI em­ploy­ees sign

bhauth27 Nov 2023 5:21 UTC
49 points
23 comments1 min readLW link

Sam Alt­man: “Plan­ning for AGI and be­yond”

LawrenceC24 Feb 2023 20:28 UTC
104 points
54 comments6 min readLW link
(openai.com)

AI #2

Zvi2 Mar 2023 14:50 UTC
66 points
18 comments55 min readLW link
(thezvi.wordpress.com)

OpenAI in­tro­duce ChatGPT API at 1/​10th the pre­vi­ous $/​token

Arthur Conmy1 Mar 2023 20:48 UTC
28 points
4 comments1 min readLW link
(openai.com)

Com­ments on OpenAI’s “Plan­ning for AGI and be­yond”

So8res3 Mar 2023 23:01 UTC
148 points
2 comments14 min readLW link

[Linkpost] Scott Alexan­der re­acts to OpenAI’s lat­est post

Akash11 Mar 2023 22:24 UTC
27 points
0 comments5 min readLW link
(astralcodexten.substack.com)

Boy­cott OpenAI

PeterMcCluskey18 Jun 2024 19:52 UTC
163 points
26 comments1 min readLW link
(bayesianinvestor.com)

ARC tests to see if GPT-4 can es­cape hu­man con­trol; GPT-4 failed to do so

Christopher King15 Mar 2023 0:29 UTC
116 points
22 comments2 min readLW link

Re­view of Align­ment Plan Cri­tiques- De­cem­ber AI-Plans Cri­tique-a-Thon Re­sults

Iknownothing15 Jan 2024 19:37 UTC
24 points
0 comments25 min readLW link
(aiplans.substack.com)

Elon files grave charges against OpenAI

mako yass1 Mar 2024 17:42 UTC
38 points
10 comments1 min readLW link
(www.courthousenews.com)

OpenAI’s In­tel­li­gence Levels

infinibot2713 Jul 2024 6:25 UTC
1 point
0 comments1 min readLW link
(www.bloomberg.com)

Open AI co-founder on AGI

ShardPhoenix16 Sep 2018 10:18 UTC
31 points
1 comment1 min readLW link
(youtu.be)

[Question] Who owns OpenAI’s new lan­guage model?

ioannes14 Feb 2019 17:51 UTC
16 points
9 comments1 min readLW link

[Link] In­tro­duc­ing OpenAI

Baughn11 Dec 2015 21:54 UTC
34 points
49 comments1 min readLW link

[Link] OpenAI on why we need so­cial scientists

ioannes19 Feb 2019 16:59 UTC
14 points
3 comments1 min readLW link

[Link] OpenAI LP

Alexei12 Mar 2019 23:22 UTC
13 points
0 comments1 min readLW link

The Hacker Learns to Trust

Ben Pace22 Jun 2019 0:27 UTC
80 points
18 comments8 min readLW link
(medium.com)

[Question] How does OpenAI’s lan­guage model af­fect our AI timeline es­ti­mates?

jimrandomh15 Feb 2019 3:11 UTC
50 points
7 comments1 min readLW link

[LINK] OpenAI do­ing an AMA today

Vika9 Jan 2016 14:47 UTC
6 points
3 comments1 min readLW link

Mis­nam­ing and Other Is­sues with OpenAI’s “Hu­man Level” Su­per­in­tel­li­gence Hierarchy

Davidmanheim15 Jul 2024 5:50 UTC
48 points
2 comments3 min readLW link

What’s Go­ing on With OpenAI’s Mes­sag­ing?

ozziegooen21 May 2024 2:22 UTC
191 points
13 comments1 min readLW link

Chris Olah’s views on AGI safety

evhub1 Nov 2019 20:13 UTC
207 points
38 comments12 min readLW link2 reviews

OpenAI’s Pre­pared­ness Frame­work: Praise & Recommendations

Akash2 Jan 2024 16:20 UTC
66 points
1 comment7 min readLW link

Why did ChatGPT say that? Prompt en­g­ineer­ing and more, with PIZZA.

Jessica Rumbelow3 Aug 2024 12:07 UTC
40 points
2 comments4 min readLW link

John Schul­man leaves OpenAI for Anthropic

Sodium6 Aug 2024 1:23 UTC
57 points
0 comments1 min readLW link

AI #76: Six Shorts Sto­ries About OpenAI

Zvi8 Aug 2024 13:50 UTC
53 points
10 comments48 min readLW link
(thezvi.wordpress.com)

My May 2023 pri­ori­ties for AI x-safety: more em­pa­thy, more unifi­ca­tion of con­cerns, and less vil­ifi­ca­tion of OpenAI

Andrew_Critch24 May 2023 0:02 UTC
267 points
39 comments8 min readLW link

GPT-4o Sys­tem Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC
68 points
11 comments2 min readLW link
(openai.com)

OpenAI: Fallout

Zvi28 May 2024 13:20 UTC
204 points
25 comments36 min readLW link
(thezvi.wordpress.com)

[Question] What do we know about the AI knowl­edge and views, es­pe­cially about ex­is­ten­tial risk, of the new OpenAI board mem­bers?

Zvi11 Mar 2024 14:55 UTC
60 points
2 comments2 min readLW link

OpenAI: Pre­pared­ness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC
70 points
23 comments4 min readLW link
(openai.com)

OpenAI in­tro­duces func­tion call­ing for GPT-4

20 Jun 2023 1:58 UTC
24 points
3 comments4 min readLW link
(openai.com)

AI safety ad­vo­cates should con­sider pro­vid­ing gen­tle push­back fol­low­ing the events at OpenAI

civilsociety22 Dec 2023 18:55 UTC
16 points
5 comments3 min readLW link

The fu­ture of Hu­mans: Oper­a­tors of AI

François-Joseph Lacroix30 Dec 2023 23:46 UTC
1 point
0 comments1 min readLW link
(medium.com)

The Un­der­re­ac­tion to OpenAI

Sherrinford18 Jan 2024 22:08 UTC
21 points
0 comments6 min readLW link

OpenAI Credit Ac­count (2510$)

Emirhan BULUT21 Jan 2024 2:30 UTC
1 point
0 comments1 min readLW link

OpenAI Credit Ac­count (2510$)

Emirhan BULUT21 Jan 2024 2:32 UTC
1 point
0 comments1 min readLW link

Fif­teen Law­suits against OpenAI

Remmelt9 Mar 2024 12:22 UTC
27 points
4 comments1 min readLW link

W2SG: Introduction

Maria Kapros10 Mar 2024 16:25 UTC
1 point
2 comments10 min readLW link

OpenAI: The Board Expands

Zvi12 Mar 2024 14:00 UTC
92 points
1 comment30 min readLW link
(thezvi.wordpress.com)

The In­for­ma­tion: OpenAI shows ‘Straw­berry’ to feds, races to launch it

Martín Soto27 Aug 2024 23:10 UTC
144 points
15 comments3 min readLW link

Former OpenAI Su­per­al­ign­ment Re­searcher: Su­per­in­tel­li­gence by 2030

Julian Bradshaw5 Jun 2024 3:35 UTC
69 points
30 comments1 min readLW link
(situational-awareness.ai)

Why I’m do­ing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC
106 points
16 comments4 min readLW link

GPT-4o is out

WitheringWeights13 May 2024 18:33 UTC
21 points
1 comment1 min readLW link

[Question] How is GPT-4o Re­lated to GPT-4?

Joel Burget15 May 2024 18:33 UTC
10 points
2 comments1 min readLW link

80,000 hours should re­move OpenAI from the Job Board (and similar EA orgs should do similarly)

Raemon3 Jul 2024 20:34 UTC
272 points
71 comments1 min readLW link

[EAFo­rum xpost] A break­down of OpenAI’s revenue

10 Jul 2024 18:09 UTC
57 points
5 comments1 min readLW link
(forum.effectivealtruism.org)

OpenAI Boy­cott Revisit

Jake Dennie22 Jul 2024 1:44 UTC
17 points
2 comments2 min readLW link

Mira Mu­rati leaves OpenAI/​ OpenAI to re­move non-profit control

Sodium25 Sep 2024 21:15 UTC
58 points
4 comments2 min readLW link

Sam Alt­man’s Busi­ness Negging

Julian Bradshaw30 Sep 2024 21:06 UTC
13 points
0 comments1 min readLW link
(www.bloomberg.com)

OpenAI defected, but we can take hon­est actions

Remmelt21 Oct 2024 8:41 UTC
17 points
15 comments1 min readLW link

Miles Brundage re­signed from OpenAI, and his AGI readi­ness team was disbanded

garrison23 Oct 2024 23:40 UTC
118 points
1 comment7 min readLW link
(garrisonlovely.substack.com)

Meta AI (FAIR) lat­est pa­per in­te­grates sys­tem-1 and sys­tem-2 think­ing into rea­son­ing mod­els.

happy friday24 Oct 2024 16:54 UTC
8 points
0 comments1 min readLW link

[Question] Is OpenAI net nega­tive for AI Safety?

Lysandre Terrisse2 Nov 2024 16:18 UTC
4 points
0 comments1 min readLW link

[Question] Us­ing hex to get mur­der ad­vice from GPT-4o

Laurence Freeman13 Nov 2024 18:30 UTC
10 points
5 comments6 min readLW link

Dario Amodei leaves OpenAI

Daniel Kokotajlo29 Dec 2020 19:31 UTC
69 points
13 comments1 min readLW link

[Question] GPT-4 Specs: 1 Trillion Pa­ram­e­ters?

infinibot2726 Mar 2023 18:56 UTC
6 points
8 comments1 min readLW link

What can we learn from Lex Frid­man’s in­ter­view with Sam Alt­man?

Karl von Wendt27 Mar 2023 6:27 UTC
56 points
22 comments9 min readLW link

I had a chat with GPT-4 on the fu­ture of AI and AI safety

Kristian Freed28 Mar 2023 17:47 UTC
1 point
0 comments8 min readLW link

AGI: Hire Soft­ware Eng­ineers—All of Them, Right Now

MGow30 Mar 2023 18:40 UTC
−18 points
3 comments1 min readLW link

[Question] Trans­former trained on it’s own con­tent?

Micromegas1 Apr 2023 15:08 UTC
1 point
0 comments1 min readLW link

OpenAI: Our ap­proach to AI safety

Jacob G-W5 Apr 2023 20:26 UTC
1 point
1 comment1 min readLW link
(openai.com)

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023 17:59 UTC
36 points
2 comments2 min readLW link
(arxiv.org)

[Linkpost] Sam Alt­man’s 2015 Blog Posts Ma­chine In­tel­li­gence Parts 1 & 2

OliviaJ28 Apr 2023 16:02 UTC
70 points
4 comments9 min readLW link

Ilya: The AI sci­en­tist shap­ing the world

David Varga20 Nov 2023 13:09 UTC
11 points
0 comments4 min readLW link

A Girar­dian in­ter­pre­ta­tion of the Alt­man af­fair, it’s on my to-do list

Bill Benzon20 Nov 2023 12:21 UTC
2 points
0 comments1 min readLW link

OpenAI: Facts from a Weekend

Zvi20 Nov 2023 15:30 UTC
271 points
165 comments9 min readLW link
(thezvi.wordpress.com)

[Question] Is OpenAI los­ing money on each re­quest?

thenoviceoof1 Dec 2023 3:27 UTC
8 points
8 comments5 min readLW link

OpenAI: Alt­man Returns

Zvi30 Nov 2023 14:10 UTC
66 points
12 comments11 min readLW link
(thezvi.wordpress.com)

OpenAI: The Bat­tle of the Board

Zvi22 Nov 2023 17:30 UTC
281 points
83 comments11 min readLW link
(thezvi.wordpress.com)

In defence of He­len Toner, Adam D’An­gelo, and Tasha McCauley (OpenAI post)

mrtreasure5 Dec 2023 18:40 UTC
6 points
2 comments1 min readLW link
(pastebin.com)

**In defence of He­len Toner, Adam D’An­gelo, and Tasha McCauley**

mrtreasure6 Dec 2023 2:02 UTC
25 points
3 comments9 min readLW link
(pastebin.com)

Cur­rent AI Safety Roles for Soft­ware Engineers

ozziegooen9 Nov 2018 20:57 UTC
70 points
9 comments4 min readLW link

Zoom In: An In­tro­duc­tion to Circuits

evhub10 Mar 2020 19:36 UTC
85 points
11 comments2 min readLW link
(distill.pub)

Us­ing ra­tio­nal­ity to de­bug Ma­chine Learning

Dr_Manhattan10 Apr 2018 20:03 UTC
20 points
3 comments1 min readLW link
(amid.fish)

Align­ment Newslet­ter #13: 07/​02/​18

Rohin Shah2 Jul 2018 16:10 UTC
70 points
12 comments8 min readLW link
(mailchi.mp)

OpenAI re­leases func­tional Dota 5v5 bot, aims to beat world cham­pi­ons by August

habryka26 Jun 2018 22:40 UTC
53 points
12 comments1 min readLW link
(blog.openai.com)

Rishi Su­nak men­tions “ex­is­ten­tial threats” in talk with OpenAI, Deep­Mind, An­thropic CEOs

24 May 2023 21:06 UTC
34 points
1 comment1 min readLW link
(www.gov.uk)

On AI and Compute

johncrox3 Apr 2019 19:00 UTC
36 points
10 comments5 min readLW link

Pro­gram­ming AGI is impossible

Áron Ecsenyi30 May 2023 23:05 UTC
1 point
0 comments4 min readLW link

OpenAI: Exodus

Zvi20 May 2024 13:10 UTC
153 points
26 comments44 min readLW link
(thezvi.wordpress.com)

OpenAI Launches Su­per­al­ign­ment Taskforce

Zvi11 Jul 2023 13:00 UTC
149 points
40 comments49 min readLW link
(thezvi.wordpress.com)

“Learn­ing to Sum­ma­rize with Hu­man Feed­back”—OpenAI

[deleted]7 Sep 2020 17:59 UTC
57 points
3 comments1 min readLW link

Could We Au­to­mate AI Align­ment Re­search?

Stephen McAleese10 Aug 2023 12:17 UTC
32 points
10 comments21 min readLW link

Hiring en­g­ineers and re­searchers to help al­ign GPT-3

paulfchristiano1 Oct 2020 18:54 UTC
206 points
13 comments3 min readLW link

De­bate up­date: Obfus­cated ar­gu­ments problem

Beth Barnes23 Dec 2020 3:24 UTC
135 points
24 comments16 min readLW link

Imi­ta­tive Gen­er­al­i­sa­tion (AKA ‘Learn­ing the Prior’)

Beth Barnes10 Jan 2021 0:30 UTC
107 points
15 comments11 min readLW link1 review

OpenAI: “Scal­ing Laws for Trans­fer”, Her­nan­dez et al.

Lukas Finnveden4 Feb 2021 12:49 UTC
14 points
3 comments1 min readLW link
(arxiv.org)

OpenAI Solves (Some) For­mal Math Olympiad Problems

Michaël Trazzi2 Feb 2022 21:49 UTC
78 points
27 comments2 min readLW link

Scott Aaron­son is join­ing OpenAI to work on AI safety

peterbarnett18 Jun 2022 4:06 UTC
117 points
31 comments1 min readLW link
(scottaaronson.blog)

Eval­u­at­ing OpenAI’s al­ign­ment plans us­ing train­ing stories

ojorgensen25 Aug 2022 16:12 UTC
4 points
0 comments5 min readLW link

The Slip­pery Slope from DALLE-2 to Deep­fake Anarchy

scasper5 Nov 2022 14:53 UTC
17 points
9 comments11 min readLW link

A first suc­cess story for Outer Align­ment: In­struc­tGPT

Noosphere898 Nov 2022 22:52 UTC
6 points
1 comment1 min readLW link
(openai.com)

Steer­ing Be­havi­our: Test­ing for (Non-)My­opia in Lan­guage Models

5 Dec 2022 20:28 UTC
40 points
19 comments10 min readLW link

ChatGPT: First Impressions

specbug1 Dec 2022 16:36 UTC
18 points
2 comments13 min readLW link
(sixeleven.in)

Did ChatGPT just gaslight me?

TW1231 Dec 2022 5:41 UTC
123 points
45 comments9 min readLW link
(aiwatchtower.substack.com)

[LINK] - ChatGPT discussion

JanB1 Dec 2022 15:04 UTC
13 points
8 comments1 min readLW link
(openai.com)

ChatGPT seems over­con­fi­dent to me

qbolec4 Dec 2022 8:03 UTC
19 points
3 comments16 min readLW link

Bioweapons, and ChatGPT (an­other vuln­er­a­bil­ity story)

Beeblebrox7 Dec 2022 7:27 UTC
−5 points
0 comments2 min readLW link

[Link] Why I’m op­ti­mistic about OpenAI’s al­ign­ment approach

janleike5 Dec 2022 22:51 UTC
98 points
15 comments1 min readLW link
(aligned.substack.com)

ChatGPT un­der­stands, but largely does not gen­er­ate Span­glish (and other code-mixed) text

Milan W23 Dec 2022 17:40 UTC
15 points
4 comments4 min readLW link

On the Im­por­tance of Open Sourc­ing Re­ward Models

elandgre2 Jan 2023 19:01 UTC
18 points
5 comments6 min readLW link

OpenAI’s Align­ment Plan is not S.M.A.R.T.

Søren Elverlin18 Jan 2023 6:39 UTC
9 points
19 comments4 min readLW link

NYT: Google will “re­cal­ibrate” the risk of re­leas­ing AI due to com­pe­ti­tion with OpenAI

Michael Huang22 Jan 2023 8:38 UTC
47 points
2 comments1 min readLW link
(www.nytimes.com)

Microsoft and OpenAI, stop tel­ling chat­bots to role­play as AI

hold_my_fish17 Feb 2023 19:55 UTC
49 points
10 comments1 min readLW link

GPT-4 Predictions

Stephen McAleese17 Feb 2023 23:20 UTC
109 points
27 comments11 min readLW link

Syd­ney the Bin­gena­tor Can’t Think, But It Still Threat­ens People

Valentin Baltadzhiev20 Feb 2023 18:37 UTC
−3 points
2 comments8 min readLW link

[Question] In­ject­ing noise to GPT to get mul­ti­ple answers

bipolo22 Feb 2023 20:02 UTC
1 point
1 comment1 min readLW link

GPT-4

nz14 Mar 2023 17:02 UTC
150 points
149 comments1 min readLW link
(openai.com)

Why not just boy­cott LLMs?

lmbp15 Mar 2023 17:55 UTC
11 points
5 comments3 min readLW link

Nyarlathotep Stirs: A Meta-Nar­ra­tive ChatGPT Story

Charlie Sanders20 Mar 2023 8:00 UTC
4 points
2 comments12 min readLW link
(dailymicrofiction.substack.com)
No comments.