All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

Instantiating an agent with GPT-4 and text-davinci-003

Max H19 Mar 2023 23:57 UTC

13 points

3 comments32 min readLW link

Can This Idea Dramatically Improve Effective Vegan Activism?

NothingIsArt19 Mar 2023 23:39 UTC

−5 points

1 comment1 min readLW link

Value Pluralism and AI

Göran Crafte19 Mar 2023 23:38 UTC

8 points

4 comments2 min readLW link

Tabooing “Frame Control”

Raemon19 Mar 2023 23:33 UTC

66 points

41 comments10 min readLW link

High Status Eschews Quantification of Performance

niplav19 Mar 2023 22:14 UTC

126 points

36 comments5 min readLW link

The Hidden Complexity of Thought

Isaac King19 Mar 2023 21:59 UTC

15 points

3 comments3 min readLW link

(outsidetheasylum.blog)

[Question] “Wide” vs “Tall” superintelligence

Templarrr19 Mar 2023 19:23 UTC

15 points

8 comments1 min readLW link

Humanity’s Lack of Unity Will Lead to AGI Catastrophe

MiguelDev19 Mar 2023 19:18 UTC

3 points

2 comments4 min readLW link

Probabilistic Payor Lemma?

abramdemski19 Mar 2023 17:57 UTC

69 points

7 comments4 min readLW link

AGI is uncontrollable, alignment is impossible

Donatas Lučiūnas19 Mar 2023 17:49 UTC

−12 points

21 comments1 min readLW link

Playbook for the Great Divergence

intellectronica19 Mar 2023 17:42 UTC

14 points

0 comments3 min readLW link

(www.intellectronica.net)

How AI could workaround goals if rated by people

ProgramCrafter19 Mar 2023 15:51 UTC

1 point

1 comment1 min readLW link

[Question] GPT-4 and ASCII Images?

carterallen19 Mar 2023 15:46 UTC

10 points

17 comments1 min readLW link

A tension between two prosaic alignment subgoals

Alex Lawsen 19 Mar 2023 14:07 UTC

31 points

8 comments1 min readLW link

Shell games

TsviBT19 Mar 2023 10:43 UTC

85 points

8 comments4 min readLW link

Self-censorship is probably bad for epistemology. Maybe we should figure out a way to avoid it?

DaemonicSigil19 Mar 2023 9:04 UTC

6 points

1 comment3 min readLW link

Mahler 6 at the San Francisco Symphony

yakimoff19 Mar 2023 4:06 UTC

1 point

0 comments1 min readLW link

Feature proposal: integrate LessWrong with ChatGPT to promote active reading

DirectedEvolution19 Mar 2023 3:41 UTC

10 points

4 comments1 min readLW link

Against Deep Ideas

YafahEdelman19 Mar 2023 3:04 UTC

53 points

14 comments2 min readLW link

More information about the dangerous capability evaluations we did with GPT-4 and Claude.

Beth Barnes19 Mar 2023 0:25 UTC

233 points

54 comments8 min readLW link

(evals.alignment.org)

Cryonics companies should let people make conditions for reawakening

Andrew Vlahos18 Mar 2023 21:03 UTC

10 points

11 comments4 min readLW link

“Publish or Perish” (a quick note on why you should try to make your work legible to existing academic communities)

David Scott Krueger (formerly: capybaralet)18 Mar 2023 19:01 UTC

99 points

49 comments1 min readLW link 1 review

Dan Luu on “You can only communicate one top priority”

Raemon18 Mar 2023 18:55 UTC

148 points

18 comments3 min readLW link

(twitter.com)

An Appeal to AI Superintelligence: Reasons to Preserve Humanity

James_Miller18 Mar 2023 16:22 UTC

37 points

73 comments12 min readLW link

[Question] What did you do with GPT4?

ChristianKl18 Mar 2023 15:21 UTC

27 points

17 comments1 min readLW link

Try to solve the hard parts of the alignment problem

Mikhail Samin18 Mar 2023 14:55 UTC

54 points

33 comments5 min readLW link

Testing ChatGPT 3.5 for political biases using roleplaying prompts

twkaiser18 Mar 2023 11:42 UTC

−2 points

2 comments19 min readLW link

(hackernoon.com)

What I did to reduce the risk of Long COVID (and manage symptoms) after getting COVID

Sameerishere18 Mar 2023 5:32 UTC

11 points

3 comments10 min readLW link

(retired article) AGI With Internet Access: Why we won’t stuff the genie back in its bottle.

Max TK18 Mar 2023 3:43 UTC

5 points

10 comments4 min readLW link

St. Patty’s Day LA meetup

lc18 Mar 2023 0:00 UTC

8 points

0 comments1 min readLW link

[Question] Why Carl Jung is not popular in AI Alignment Research?

MiguelDev17 Mar 2023 23:56 UTC

−3 points

13 comments1 min readLW link

[Event] Join Metaculus for Forecast Friday on March 24th!

ChristianWilliams17 Mar 2023 22:47 UTC

3 points

0 comments1 min readLW link

Meetup Tip: The Next Meetup Will Be. . .

Screwtape17 Mar 2023 22:04 UTC

43 points

0 comments3 min readLW link

The Power of High Speed Stupidity

robotelvis17 Mar 2023 21:41 UTC

33 points

5 comments9 min readLW link

(messyprogress.substack.com)

Retrospective on ‘GPT-4 Predictions’ After the Release of GPT-4

Stephen McAleese17 Mar 2023 18:34 UTC

26 points

6 comments6 min readLW link

“Carefully Bootstrapped Alignment” is organizationally hard

Raemon17 Mar 2023 18:00 UTC

261 points

22 comments11 min readLW link

[Question] Are nested jailbreaks inevitable?

judson17 Mar 2023 17:43 UTC

1 point

0 comments1 min readLW link

Ethical AI investments?

Justin wilson17 Mar 2023 17:43 UTC

24 points

15 comments1 min readLW link

New economic system for AI era

ksme sho17 Mar 2023 17:42 UTC

−1 points

1 comment5 min readLW link

On some first principles of intelligence

Macheng_Shen17 Mar 2023 17:42 UTC

−14 points

0 comments4 min readLW link

Essential Behaviorism Terms

Rivka17 Mar 2023 17:41 UTC

15 points

1 comment10 min readLW link

Vector semantics and “Kubla Khan,” Part 2

Bill Benzon17 Mar 2023 16:32 UTC

2 points

0 comments3 min readLW link

Super-Luigi = Luigi + (Luigi—Waluigi)

Alexei17 Mar 2023 15:27 UTC

16 points

9 comments1 min readLW link

Survey on intermediate goals in AI governance

MichaelA and MaxRa

17 Mar 2023 13:12 UTC

25 points

3 comments1 min readLW link

GPT-4 solves Gary Marcus-induced flubs

JakubK17 Mar 2023 6:40 UTC

56 points

29 comments2 min readLW link

(docs.google.com)

[Question] Are the LLM “intelligence” tests publicly available for humans to take?

nim17 Mar 2023 0:09 UTC

7 points

12 comments1 min readLW link

Donation offsets for ChatGPT Plus subscriptions

Jeffrey Ladish16 Mar 2023 23:29 UTC

53 points

3 comments3 min readLW link

The algorithm isn’t doing X, it’s just doing Y.

Cleo Nardo16 Mar 2023 23:28 UTC

53 points

43 comments5 min readLW link

Announcing the ERA Cambridge Summer Research Fellowship

Nandini Shiralkar16 Mar 2023 22:57 UTC

11 points

0 comments3 min readLW link

Gradual takeoff, fast failure

Max H16 Mar 2023 22:02 UTC

15 points

4 comments5 min readLW link