Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
janus
Karma:
3,357
All
Posts
Comments
New
Top
Old
How LLMs are and are not myopic
janus
Jul 25, 2023, 2:19 AM
134
points
16
comments
8
min read
LW
link
[Simulators seminar sequence] #2 Semiotic physics—revamped
Jan
,
Charlie Steiner
,
Logan Riggs
,
janus
,
jacquesthibs
,
metasemi
,
Michael Oesterle
,
Lucas Teixeira
,
peligrietzer
and
remember
Feb 27, 2023, 12:25 AM
24
points
23
comments
13
min read
LW
link
Cyborgism
NicholasKees
and
janus
Feb 10, 2023, 2:47 PM
336
points
46
comments
35
min read
LW
link
2
reviews
Anomalous tokens reveal the original identities of Instruct models
janus
and
jdp
Feb 9, 2023, 1:30 AM
139
points
16
comments
9
min read
LW
link
(generative.ink)
Gradient Filtering
Jozdien
and
janus
Jan 18, 2023, 8:09 PM
55
points
16
comments
13
min read
LW
link
Language Ex Machina
janus
Jan 15, 2023, 9:19 AM
41
points
23
comments
24
min read
LW
link
(generative.ink)
Simulacra are Things
janus
Jan 8, 2023, 11:03 PM
63
points
7
comments
2
min read
LW
link
[Simulators seminar sequence] #1 Background & shared assumptions
Jan
,
Charlie Steiner
,
Logan Riggs
,
janus
,
jacquesthibs
,
metasemi
,
Michael Oesterle
,
Lucas Teixeira
,
peligrietzer
and
remember
Jan 2, 2023, 11:48 PM
50
points
4
comments
3
min read
LW
link
Results from a survey on tool use and workflows in alignment research
jacquesthibs
,
Jan
,
janus
and
Logan Riggs
Dec 19, 2022, 3:19 PM
79
points
2
comments
19
min read
LW
link
Searching for Search
NicholasKees
and
janus
Nov 28, 2022, 3:31 PM
94
points
9
comments
14
min read
LW
link
1
review
Update to Mysteries of mode collapse: text-davinci-002 not RLHF
janus
Nov 19, 2022, 11:51 PM
71
points
8
comments
2
min read
LW
link
[simulation] 4chan user claiming to be the attorney hired by Google’s sentient chatbot LaMDA shares wild details of encounter
janus
Nov 10, 2022, 9:39 PM
19
points
1
comment
13
min read
LW
link
(generative.ink)
Mysteries of mode collapse
janus
Nov 8, 2022, 10:37 AM
284
points
57
comments
14
min read
LW
link
1
review
Simulators
janus
Sep 2, 2022, 12:45 PM
618
points
168
comments
41
min read
LW
link
8
reviews
(generative.ink)
A descriptive, not prescriptive, overview of current AI Alignment Research
Jan
,
Logan Riggs
,
jacquesthibs
and
janus
Jun 6, 2022, 9:59 PM
139
points
21
comments
7
min read
LW
link
A survey of tool use and workflows in alignment research
Logan Riggs
,
Jan
,
janus
and
jacquesthibs
Mar 23, 2022, 11:44 PM
45
points
4
comments
1
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel