Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
LED Brain Stimulation for Productivity
Simon Berens
20 Mar 2023 22:30 UTC
13
points
6
comments
1
min read
LW
link
(news.ycombinator.com)
Remarks 1–18 on GPT (compressed)
Cleo Nardo
20 Mar 2023 22:27 UTC
148
points
35
comments
31
min read
LW
link
[Question]
What does pulling the fire alarm look like?
nem
20 Mar 2023 21:45 UTC
2
points
0
comments
1
min read
LW
link
Exploring GPT4′s world model
hippke
20 Mar 2023 21:31 UTC
−5
points
5
comments
2
min read
LW
link
The Wizard of Oz Problem: How incentives and narratives can skew our perception of AI developments
Akash
20 Mar 2023 20:44 UTC
16
points
3
comments
6
min read
LW
link
SSC/ACX Meetups Everywhere Spring Bordeaux 22 April 17:00 local
vi21maobk9vp
20 Mar 2023 19:03 UTC
5
points
3
comments
1
min read
LW
link
AGI will know: Humans are not Rational
HumaneAutomation
20 Mar 2023 18:46 UTC
0
points
10
comments
2
min read
LW
link
AI and the Map of Your Mind: Pattern Recognition
Scott Broock
20 Mar 2023 17:43 UTC
2
points
2
comments
6
min read
LW
link
The dreams of GPT-4
RomanS
20 Mar 2023 17:00 UTC
14
points
7
comments
9
min read
LW
link
RLHF does not appear to differentially cause mode-collapse
Arthur Conmy
and
beren
20 Mar 2023 15:39 UTC
95
points
9
comments
3
min read
LW
link
[Question]
Avoiding “enlightenment” experiences while meditating for anxiety?
wunan
20 Mar 2023 13:03 UTC
17
points
6
comments
1
min read
LW
link
Will people be motivated to learn difficult disciplines and skills without economic incentive?
Roman Leventov
20 Mar 2023 9:26 UTC
18
points
33
comments
5
min read
LW
link
What does it mean for an LLM such as GPT to be aligned / good / positive impact?
PashaKamyshev
20 Mar 2023 9:21 UTC
4
points
3
comments
10
min read
LW
link
Nyarlathotep Stirs: A Meta-Narrative ChatGPT Story
Charlie Sanders
20 Mar 2023 8:00 UTC
4
points
2
comments
12
min read
LW
link
(dailymicrofiction.substack.com)
Let’s make the truth easier to find
DPiepgrass
20 Mar 2023 4:28 UTC
29
points
44
comments
1
min read
LW
link
EA & LW Forum Weekly Summary (13th − 19th March 2023)
Zoe Williams
20 Mar 2023 4:18 UTC
13
points
0
comments
1
min read
LW
link
Are COVID lab leak and market origin theories incompatible?
Anon User
20 Mar 2023 1:44 UTC
15
points
6
comments
1
min read
LW
link
[Question]
What do “attractor dynamics” refer to in the context of social structures?
JavierCC
20 Mar 2023 1:39 UTC
2
points
2
comments
1
min read
LW
link
The Natural State is Goodhart
devansh
20 Mar 2023 0:00 UTC
59
points
4
comments
2
min read
LW
link
Instantiating an agent with GPT-4 and text-davinci-003
Max H
19 Mar 2023 23:57 UTC
13
points
3
comments
32
min read
LW
link
Can This Idea Dramatically Improve Effective Vegan Activism?
NothingIsArt
19 Mar 2023 23:39 UTC
−5
points
1
comment
1
min read
LW
link
Value Pluralism and AI
Göran Crafte
19 Mar 2023 23:38 UTC
8
points
4
comments
2
min read
LW
link
Tabooing “Frame Control”
Raemon
19 Mar 2023 23:33 UTC
66
points
41
comments
10
min read
LW
link
High Status Eschews Quantification of Performance
niplav
19 Mar 2023 22:14 UTC
127
points
36
comments
5
min read
LW
link
The Hidden Complexity of Thought
Isaac King
19 Mar 2023 21:59 UTC
15
points
3
comments
3
min read
LW
link
(outsidetheasylum.blog)
[Question]
“Wide” vs “Tall” superintelligence
Templarrr
19 Mar 2023 19:23 UTC
15
points
8
comments
1
min read
LW
link
Humanity’s Lack of Unity Will Lead to AGI Catastrophe
MiguelDev
19 Mar 2023 19:18 UTC
3
points
2
comments
4
min read
LW
link
Probabilistic Payor Lemma?
abramdemski
19 Mar 2023 17:57 UTC
69
points
7
comments
4
min read
LW
link
AGI is uncontrollable, alignment is impossible
Donatas Lučiūnas
19 Mar 2023 17:49 UTC
−12
points
21
comments
1
min read
LW
link
Playbook for the Great Divergence
intellectronica
19 Mar 2023 17:42 UTC
14
points
0
comments
3
min read
LW
link
(www.intellectronica.net)
How AI could workaround goals if rated by people
ProgramCrafter
19 Mar 2023 15:51 UTC
1
point
1
comment
1
min read
LW
link
[Question]
GPT-4 and ASCII Images?
carterallen
19 Mar 2023 15:46 UTC
10
points
17
comments
1
min read
LW
link
A tension between two prosaic alignment subgoals
Alex Lawsen
19 Mar 2023 14:07 UTC
31
points
8
comments
1
min read
LW
link
Shell games
TsviBT
19 Mar 2023 10:43 UTC
91
points
9
comments
4
min read
LW
link
1
review
Self-censorship is probably bad for epistemology. Maybe we should figure out a way to avoid it?
DaemonicSigil
19 Mar 2023 9:04 UTC
6
points
1
comment
3
min read
LW
link
Mahler 6 at the San Francisco Symphony
yakimoff
19 Mar 2023 4:06 UTC
1
point
0
comments
1
min read
LW
link
Feature proposal: integrate LessWrong with ChatGPT to promote active reading
DirectedEvolution
19 Mar 2023 3:41 UTC
10
points
4
comments
1
min read
LW
link
Against Deep Ideas
YafahEdelman
19 Mar 2023 3:04 UTC
53
points
14
comments
2
min read
LW
link
More information about the dangerous capability evaluations we did with GPT-4 and Claude.
Beth Barnes
19 Mar 2023 0:25 UTC
233
points
54
comments
8
min read
LW
link
(evals.alignment.org)
Cryonics companies should let people make conditions for reawakening
Andrew Vlahos
18 Mar 2023 21:03 UTC
10
points
11
comments
4
min read
LW
link
“Publish or Perish” (a quick note on why you should try to make your work legible to existing academic communities)
David Scott Krueger (formerly: capybaralet)
18 Mar 2023 19:01 UTC
99
points
49
comments
1
min read
LW
link
1
review
Dan Luu on “You can only communicate one top priority”
Raemon
18 Mar 2023 18:55 UTC
148
points
18
comments
3
min read
LW
link
(twitter.com)
An Appeal to AI Superintelligence: Reasons to Preserve Humanity
James_Miller
18 Mar 2023 16:22 UTC
37
points
73
comments
12
min read
LW
link
[Question]
What did you do with GPT4?
ChristianKl
18 Mar 2023 15:21 UTC
27
points
17
comments
1
min read
LW
link
Try to solve the hard parts of the alignment problem
Mikhail Samin
18 Mar 2023 14:55 UTC
54
points
33
comments
5
min read
LW
link
Testing ChatGPT 3.5 for political biases using roleplaying prompts
twkaiser
18 Mar 2023 11:42 UTC
−2
points
2
comments
19
min read
LW
link
(hackernoon.com)
What I did to reduce the risk of Long COVID (and manage symptoms) after getting COVID
Sameerishere
18 Mar 2023 5:32 UTC
11
points
3
comments
10
min read
LW
link
(retired article) AGI With Internet Access: Why we won’t stuff the genie back in its bottle.
Max TK
18 Mar 2023 3:43 UTC
5
points
10
comments
4
min read
LW
link
St. Patty’s Day LA meetup
lc
18 Mar 2023 0:00 UTC
8
points
0
comments
1
min read
LW
link
[Question]
Why Carl Jung is not popular in AI Alignment Research?
MiguelDev
17 Mar 2023 23:56 UTC
−3
points
13
comments
1
min read
LW
link
Back to top
Next