Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Esben Kran
Karma:
533
All
Posts
Comments
New
Top
Old
Page
1
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities
Jonathan N
,
abra
,
Connor Axiotes
and
Esben Kran
Nov 5, 2024, 1:01 AM
8
points
0
comments
6
min read
LW
link
(www.apartresearch.com)
Can startups be impactful in AI safety?
Esben Kran
and
Archana Vaidheeswaran
Sep 13, 2024, 7:00 PM
15
points
0
comments
6
min read
LW
link
Finding Deception in Language Models
Esben Kran
and
Archana Vaidheeswaran
Aug 20, 2024, 9:42 AM
20
points
4
comments
4
min read
LW
link
Results from the AI x Democracy Research Sprint
Esben Kran
,
jordine
and
Jason Hoelscher-Obermaier
Jun 14, 2024, 4:40 PM
13
points
0
comments
6
min read
LW
link
Demonstrate and evaluate risks from AI to society at the AI x Democracy research hackathon
Esben Kran
Apr 19, 2024, 2:46 PM
5
points
0
comments
LW
link
(www.apartresearch.com)
Join the AI Evaluation Tasks Bounty Hackathon
Esben Kran
Mar 18, 2024, 8:15 AM
12
points
1
comment
LW
link
Multi-Agent Security Hackathon
Esben Kran
,
Jason Hoelscher-Obermaier
and
Clement Neo
Feb 5, 2024, 10:51 PM
6
points
0
comments
1
min read
LW
link
Identifying semantic neurons, mechanistic circuits & interpretability web apps
Esben Kran
and
Neel Nanda
Apr 13, 2023, 11:59 AM
18
points
0
comments
8
min read
LW
link
Announcing the European Network for AI Safety (ENAIS)
Esben Kran
Mar 22, 2023, 5:57 PM
19
points
0
comments
LW
link
Automated Sandwiching & Quantifying Human-LLM Cooperation: ScaleOversight hackathon results
Esben Kran
,
Fazl
,
Sabrina Zaki
,
gabrielrecc
and
rz2383
Feb 23, 2023, 10:48 AM
8
points
0
comments
6
min read
LW
link
Generalizability & Hope for AI [MLAISU W03]
Esben Kran
Jan 20, 2023, 10:06 AM
5
points
2
comments
2
min read
LW
link
(newsletter.apartresearch.com)
Robustness & Evolution [MLAISU W02]
Esben Kran
Jan 13, 2023, 3:47 PM
10
points
0
comments
3
min read
LW
link
(newsletter.apartresearch.com)
AI improving AI [MLAISU W01!]
Esben Kran
Jan 6, 2023, 11:13 AM
5
points
0
comments
4
min read
LW
link
(newsletter.apartresearch.com)
Results from the AI testing hackathon
Esben Kran
Jan 2, 2023, 3:46 PM
13
points
0
comments
LW
link
Will Machines Ever Rule the World? MLAISU W50
Esben Kran
Dec 16, 2022, 11:03 AM
12
points
7
comments
4
min read
LW
link
(newsletter.apartresearch.com)
Join the AI Testing Hackathon this Friday
Esben Kran
Dec 12, 2022, 2:24 PM
10
points
0
comments
LW
link
ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49
Esben Kran
and
Steinthal
Dec 9, 2022, 10:38 AM
19
points
0
comments
4
min read
LW
link
(newsletter.apartresearch.com)
NeurIPS Safety & ChatGPT. MLAISU W48
Esben Kran
and
Steinthal
Dec 2, 2022, 3:50 PM
3
points
0
comments
4
min read
LW
link
(newsletter.apartresearch.com)
Results from the interpretability hackathon
Esben Kran
and
Neel Nanda
Nov 17, 2022, 2:51 PM
81
points
0
comments
6
min read
LW
link
(alignmentjam.com)
[Book] Interpretable Machine Learning: A Guide for Making Black Box Models Explainable
Esben Kran
Oct 31, 2022, 11:38 AM
20
points
1
comment
1
min read
LW
link
(christophm.github.io)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel