Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Vivek Hebbar
Karma:
1,104
All
Posts
Comments
New
Top
Old
How can we solve diffuse threats like research sabotage with AI control?
Vivek Hebbar
Apr 30, 2025, 7:23 PM
43
points
0
comments
8
min read
LW
link
How training-gamers might function (and win)
Vivek Hebbar
Apr 11, 2025, 9:26 PM
105
points
5
comments
13
min read
LW
link
Different senses in which two AIs can be “the same”
Vivek Hebbar
and
Buck
Jun 24, 2024, 3:16 AM
69
points
2
comments
4
min read
LW
link
Thomas Kwa’s MIRI research experience
Thomas Kwa
,
peterbarnett
,
Vivek Hebbar
,
Jeremy Gillen
,
Bird Concept
and
Raemon
Oct 2, 2023, 4:42 PM
173
points
53
comments
1
min read
LW
link
Infinite-width MLPs as an “ensemble prior”
Vivek Hebbar
May 12, 2023, 11:45 AM
46
points
0
comments
5
min read
LW
link
[Question]
Is EDT correct? Does “EDT” == “logical EDT” == “logical CDT”?
Vivek Hebbar
May 8, 2023, 2:07 AM
13
points
2
comments
1
min read
LW
link
Vivek Hebbar’s Shortform
Vivek Hebbar
Nov 24, 2022, 2:57 AM
4
points
5
comments
LW
link
Path dependence in ML inductive biases
Vivek Hebbar
and
evhub
Sep 10, 2022, 1:38 AM
68
points
13
comments
10
min read
LW
link
Hessian and Basin volume
Vivek Hebbar
Jul 10, 2022, 6:59 AM
35
points
10
comments
4
min read
LW
link
[Short version] Information Loss --> Basin flatness
Vivek Hebbar
May 21, 2022, 12:59 PM
12
points
0
comments
1
min read
LW
link
Information Loss --> Basin flatness
Vivek Hebbar
May 21, 2022, 12:58 PM
62
points
31
comments
7
min read
LW
link
Org announcement: [AC]RC
Vivek Hebbar
Apr 17, 2022, 5:24 PM
82
points
11
comments
1
min read
LW
link
[Question]
When people ask for your P(doom), do you give them your inside view or your betting odds?
Vivek Hebbar
Mar 26, 2022, 11:08 PM
11
points
11
comments
1
min read
LW
link
Transformer inductive biases & RASP
Vivek Hebbar
Feb 24, 2022, 12:42 AM
15
points
4
comments
1
min read
LW
link
(proceedings.mlr.press)
[Question]
Favorite / most obscure research on understanding DNNs?
Vivek Hebbar
Feb 21, 2022, 5:49 AM
16
points
1
comment
1
min read
LW
link
How complex are myopic imitators?
Vivek Hebbar
Feb 8, 2022, 12:00 PM
26
points
1
comment
15
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel