ArchiveSequencesAbout
QuestionsEventsShortformAlignment ForumAF Comments
HomeFeaturedAllTagsRecent Comments
RSS
NewHotActiveOld
Page 2

Power Lies Trem­bling: a three-book review

Richard_NgoFeb 22, 2025, 10:57 PM
214 points
29 comments15 min readLW link
(www.mindthefuture.info)

Eliezer’s Lost Align­ment Ar­ti­cles /​ The Ar­bital Sequence

Ruby and RobertM
Feb 20, 2025, 12:48 AM
207 points
10 comments5 min readLW link

How to Make Superbabies

GeneSmith and kman
Feb 19, 2025, 8:39 PM
605 points
349 comments31 min readLW link

Levels of Friction

ZviFeb 10, 2025, 1:10 PM
148 points
8 comments12 min readLW link
(thezvi.wordpress.com)

So You Want To Make Marginal Progress...

johnswentworthFeb 7, 2025, 11:22 PM
286 points
42 comments4 min readLW link

How AI Takeover Might Hap­pen in 2 Years

joshcFeb 7, 2025, 5:10 PM
422 points
137 comments29 min readLW link
(x.com)

Some ar­ti­cles in “In­ter­na­tional Se­cu­rity” that I enjoyed

BuckJan 31, 2025, 4:23 PM
130 points
10 comments4 min readLW link

“Sharp Left Turn” dis­course: An opinionated review

Steven ByrnesJan 28, 2025, 6:47 PM
208 points
26 comments31 min readLW link

The Case Against AI Con­trol Research

johnswentworthJan 21, 2025, 4:03 PM
353 points
80 comments6 min readLW link

The Gen­tle Romance

Richard_NgoJan 19, 2025, 6:29 PM
242 points
46 comments15 min readLW link
(www.asimov.press)

Don’t ig­nore bad vibes you get from people

Kaj_SotalaJan 18, 2025, 9:20 AM
152 points
50 comments2 min readLW link
(kajsotala.fi)

What Is The Align­ment Prob­lem?

johnswentworthJan 16, 2025, 1:20 AM
180 points
50 comments25 min readLW link

How will we up­date about schem­ing?

ryan_greenblattJan 6, 2025, 8:21 PM
171 points
20 comments37 min readLW link

Re­view: Planecrash

L Rudolf LDec 27, 2024, 2:18 PM
360 points
45 comments22 min readLW link
(nosetgauge.substack.com)

A Three-Layer Model of LLM Psychology

Jan_KulveitDec 26, 2024, 4:49 PM
217 points
13 comments8 min readLW link

What Goes Without Saying

sarahconstantinDec 20, 2024, 6:00 PM
334 points
28 comments5 min readLW link
(sarahconstantin.substack.com)

When Is In­surance Worth It?

kqrDec 19, 2024, 7:07 PM
175 points
71 comments4 min readLW link
(entropicthoughts.com)

Align­ment Fak­ing in Large Lan­guage Models

ryan_greenblatt, evhub, Carson Denison, Benjamin Wright, Fabien Roger, Monte M, Sam Marks, Johannes Treutlein, Sam Bowman and Buck
Dec 18, 2024, 5:19 PM
483 points
75 comments10 min readLW link

AIs Will In­creas­ingly At­tempt Shenanigans

ZviDec 16, 2024, 3:20 PM
114 points
2 comments26 min readLW link
(thezvi.wordpress.com)

Biolog­i­cal risk from the mir­ror world

jasoncrawfordDec 12, 2024, 7:07 PM
334 points
38 comments7 min readLW link
(newsletter.rootsofprogress.org)
PreviousBack to topNext

Customize appearance

Current theme: default

Less Wrong (text)

Less Wrong (link)

Hi, I’m Bobby the Basilisk! Click on the minimize button () to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember, the changes won’t be saved until you click “OK”!)

Theme tweaker help

  