Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
RyanCarey
Karma:
1,556
All
Posts
Comments
New
Top
Old
Page
1
Reward Hacking from a Causal Perspective
tom4everitt
,
Francis Rhys Ward
,
sbenthall
,
James Fox
,
mattmacdermott
and
RyanCarey
21 Jul 2023 18:27 UTC
29
points
6
comments
7
min read
LW
link
Incentives from a causal perspective
tom4everitt
,
James Fox
,
RyanCarey
,
mattmacdermott
,
sbenthall
and
Jonathan Richens
10 Jul 2023 17:16 UTC
27
points
0
comments
6
min read
LW
link
Causality: A Brief Introduction
tom4everitt
,
Lewis Hammond
,
Jonathan Richens
,
Francis Rhys Ward
,
RyanCarey
,
sbenthall
and
James Fox
20 Jun 2023 15:01 UTC
49
points
18
comments
6
min read
LW
link
Introduction to Towards Causal Foundations of Safe AGI
tom4everitt
,
Lewis Hammond
,
Francis Rhys Ward
,
RyanCarey
,
James Fox
,
mattmacdermott
and
sbenthall
12 Jun 2023 17:55 UTC
67
points
6
comments
4
min read
LW
link
Survey re AIS/LTism office in NYC
RyanCarey
20 Jun 2022 19:21 UTC
7
points
0
comments
1
min read
LW
link
[Question]
Mechanism design / queueing theory for government to sell visas
RyanCarey
19 Feb 2022 12:33 UTC
5
points
11
comments
1
min read
LW
link
[Question]
Problems with using approval voting to elect to a multi-individual body?
RyanCarey
28 Sep 2021 11:03 UTC
8
points
13
comments
1
min read
LW
link
RyanCarey’s Shortform
RyanCarey
24 Jan 2021 11:32 UTC
6
points
6
comments
1
min read
LW
link
New paper: The Incentives that Shape Behaviour
RyanCarey
23 Jan 2020 19:07 UTC
23
points
5
comments
1
min read
LW
link
(arxiv.org)
[Question]
What are some good examples of incorrigibility?
RyanCarey
28 Apr 2019 0:22 UTC
23
points
17
comments
1
min read
LW
link
When to use quantilization
RyanCarey
5 Feb 2019 17:17 UTC
65
points
5
comments
4
min read
LW
link
Addressing three problems with counterfactual corrigibility: bad bets, defending against backstops, and overconfidence.
RyanCarey
21 Oct 2018 12:03 UTC
23
points
1
comment
6
min read
LW
link
USA v Progressive 1979 excerpt
RyanCarey
27 Nov 2017 17:32 UTC
22
points
2
comments
2
min read
LW
link
A combined analysis of genetically correlated traits identifies 107 loci associated with intelligence | bioRxiv
RyanCarey
18 Jul 2017 6:30 UTC
4
points
1
comment
1
min read
LW
link
(www.biorxiv.org)
Equilibria in adversarial supervised learning
RyanCarey
3 May 2017 8:14 UTC
5
points
1
comment
3
min read
LW
link
Call for Special Issue on Superintelligence—Informatica
RyanCarey
3 May 2017 5:08 UTC
5
points
0
comments
1
min read
LW
link
(www.informatica.si)
Online Learning 3: Adversarial bandit learning with catastrophes
RyanCarey
14 Nov 2016 22:58 UTC
3
points
0
comments
10
min read
LW
link
Online Learning 2: Bandit learning with catastrophes
RyanCarey
29 Oct 2016 16:53 UTC
0
points
5
comments
4
min read
LW
link
Online Learning 1: Bias-detecting online learners
RyanCarey
29 Oct 2016 16:45 UTC
6
points
7
comments
3
min read
LW
link
Improving long-run civilisational robustness
RyanCarey
10 May 2016 11:15 UTC
14
points
43
comments
3
min read
LW
link
Back to top
Next