Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Chris_Leong
Karma:
7,016
All
Posts
Comments
New
Top
Old
Page
1
Linkpost: “Imagining and building wise machines: The centrality of AI metacognition” by Johnson, Karimi, Bengio, et al.
Chris_Leong
11 Nov 2024 16:13 UTC
25
points
6
comments
1
min read
LW
link
(arxiv.org)
Some Preliminary Notes on the Promise of a Wisdom Explosion
Chris_Leong
31 Oct 2024 9:21 UTC
2
points
0
comments
1
min read
LW
link
(aiimpacts.org)
Linkpost: Hypocrisy standoff
Chris_Leong
29 Sep 2024 14:27 UTC
5
points
1
comment
1
min read
LW
link
(x.com)
On the destruction of America’s best high school
Chris_Leong
12 Sep 2024 15:30 UTC
−6
points
7
comments
1
min read
LW
link
(scottaaronson.blog)
The Bar for Contributing to AI Safety is Lower than You Think
Chris_Leong
16 Aug 2024 15:20 UTC
20
points
1
comment
2
min read
LW
link
Michael Streamlines on Buddhism
Chris_Leong
9 Aug 2024 4:44 UTC
8
points
0
comments
1
min read
LW
link
(x.com)
[Question]
Have people given up on iterated distillation and amplification?
Chris_Leong
19 Jul 2024 12:23 UTC
20
points
1
comment
1
min read
LW
link
Politics is the mind-killer, but maybe we should talk about it anyway
Chris_Leong
3 Jun 2024 6:37 UTC
14
points
33
comments
3
min read
LW
link
[Question]
Does reducing the amount of RL for a given capability level make AI safer?
Chris_Leong
5 May 2024 17:04 UTC
43
points
22
comments
1
min read
LW
link
Link: Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman
Chris_Leong
27 Apr 2024 13:22 UTC
12
points
0
comments
1
min read
LW
link
(twitter.com)
“You’re the most beautiful girl in the world” and Wittgensteinian Language Games
Chris_Leong
20 Apr 2024 14:54 UTC
5
points
18
comments
1
min read
LW
link
The argument for near-term human disempowerment through AI
Chris_Leong
16 Apr 2024 4:50 UTC
21
points
2
comments
1
min read
LW
link
(link.springer.com)
Reverse Regulatory Capture
Chris_Leong
11 Apr 2024 2:40 UTC
12
points
3
comments
1
min read
LW
link
On the Confusion between Inner and Outer Misalignment
Chris_Leong
25 Mar 2024 11:59 UTC
17
points
10
comments
1
min read
LW
link
The Best Essay (Paul Graham)
Chris_Leong
11 Mar 2024 19:25 UTC
25
points
2
comments
1
min read
LW
link
(paulgraham.com)
[Question]
Can we get an AI to “do our alignment homework for us”?
Chris_Leong
26 Feb 2024 7:56 UTC
53
points
33
comments
1
min read
LW
link
[Question]
What’s the theory of impact for activation vectors?
Chris_Leong
11 Feb 2024 7:34 UTC
57
points
12
comments
1
min read
LW
link
Notice When People Are Directionally Correct
Chris_Leong
14 Jan 2024 14:12 UTC
129
points
8
comments
2
min read
LW
link
Are Metaculus AI Timelines Inconsistent?
Chris_Leong
2 Jan 2024 6:47 UTC
16
points
7
comments
2
min read
LW
link
Random Musings on Theory of Impact for Activation Vectors
Chris_Leong
7 Dec 2023 13:07 UTC
8
points
0
comments
1
min read
LW
link
Back to top
Next