Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Lukas Fluri
Karma:
17
All
Posts
Comments
New
Top
Old
Evaluating Superhuman Models with Consistency Checks
Daniel Paleka
and
Lukas Fluri
1 Aug 2023 7:51 UTC
21
points
2
comments
9
min read
LW
link
(arxiv.org)
Open Problems in Negative Side Effect Minimization
Fabian Schimpf
and
Lukas Fluri
6 May 2022 9:37 UTC
12
points
6
comments
17
min read
LW
link
Back to top