Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman
Chris_Leong
27 Apr 2024 13:22 UTC
8
points
0
comments
1
min read
LW
link
(twitter.com)
Two Vernor Vinge Book Reviews
Maxwell Tabarrok
27 Apr 2024 12:14 UTC
5
points
0
comments
2
min read
LW
link
(www.maximum-progress.com)
Refusal in LLMs is mediated by a single direction
Andy Arditi
,
Oscar Balcells Obeso
,
Aaquib111
,
wesg
and
Neel Nanda
27 Apr 2024 11:13 UTC
19
points
0
comments
9
min read
LW
link
WSJ: Thinking doesn’t have to feel so hard
trevor
27 Apr 2024 10:14 UTC
8
points
0
comments
3
min read
LW
link
(www.wsj.com)
[Question]
Plausibility of Getting Early Warning Shots because AIs can’t coordinate?
hmys
27 Apr 2024 8:02 UTC
5
points
0
comments
1
min read
LW
link
AI Safety Sphere
Myles H
27 Apr 2024 1:49 UTC
−3
points
0
comments
3
min read
LW
link
Exploring the Esoteric Pathways to AI Sentience (Part One)
jeffreycaruso
27 Apr 2024 1:02 UTC
−11
points
2
comments
2
min read
LW
link
Superposition is not “just” neuron polysemanticity
LawrenceC
26 Apr 2024 23:22 UTC
25
points
0
comments
13
min read
LW
link
D&D.Sci Long War: Defender of Data-mocracy
aphyer
26 Apr 2024 22:30 UTC
34
points
1
comment
3
min read
LW
link
On Not Pulling The Ladder Up Behind You
Screwtape
26 Apr 2024 21:58 UTC
57
points
3
comments
9
min read
LW
link
We are headed into an extreme compute overhang
devrandom
26 Apr 2024 21:38 UTC
23
points
8
comments
2
min read
LW
link
[Concept Dependency] Edge Regular Lattice Graph
Johannes C. Mayer
26 Apr 2024 21:14 UTC
5
points
0
comments
1
min read
LW
link
[Concept Dependency] Concept Dependency Posts
Johannes C. Mayer
26 Apr 2024 20:57 UTC
8
points
2
comments
2
min read
LW
link
Argumentation and College Admissions
Michael Michalchik
26 Apr 2024 20:52 UTC
1
point
0
comments
2
min read
LW
link
[Question]
Wouldn’t weak AI agents provide warning?
Mandatory Topic
26 Apr 2024 19:34 UTC
5
points
0
comments
1
min read
LW
link
Duct Tape security
Isaac King
26 Apr 2024 18:57 UTC
66
points
7
comments
5
min read
LW
link
Fundamental Uncertainty: Chapter 8 - When does fundamental uncertainty matter?
Gordon Seidoh Worley
26 Apr 2024 18:10 UTC
9
points
2
comments
32
min read
LW
link
Scaling of AI training runs will slow down after GPT-5
Maxime Riché
26 Apr 2024 16:05 UTC
32
points
5
comments
3
min read
LW
link
Spatial attention as a “tell” for empathetic simulation?
Steven Byrnes
26 Apr 2024 15:10 UTC
49
points
7
comments
8
min read
LW
link
Arch-anarchy
Peter lawless
26 Apr 2024 15:05 UTC
−1
points
1
comment
25
min read
LW
link
Back to top
Next