MIRI’s reading list on corrigbility seems out dated, and I can’t find a centralised list Does anyone have, or know of, one?
As a side note, has MIRI stopped updating their reading list? It seems like that’s the case.
EDIT:
Links given in the comment section to do with corrigibility. I’ll try and update this with some summaries as I read them.
https://www.greaterwrong.com/posts/5bd75cc58225bf0670375041/a-first-look-at-the-hard-problem-of-corrigibility
https://arbital.com/p/corrigibility/
https://arbital.com/p/updated_deference/
https://arxiv.org/pdf/1611.08219.pdf
Current theme: default
Less Wrong (text)
Less Wrong (link)
Arrow keys: Next/previous image
Escape or click: Hide zoomed image
Space bar: Reset image size & position
Scroll to zoom in/out
(When zoomed in, drag to pan; double-click to close)
Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).
]
Keys shown in grey (e.g., ?) do not require any modifier keys.
?
Esc
h
f
a
m
v
c
r
q
t
u
o
,
.
/
s
n
e
;
Enter
[
\
k
i
l
=
-
0
′
1
2
3
4
5
6
7
8
9
→
↓
←
↑
Space
x
z
`
g
Question: MIRI Corrigbility Agenda
MIRI’s reading list on corrigbility seems out dated, and I can’t find a centralised list Does anyone have, or know of, one?
As a side note, has MIRI stopped updating their reading list? It seems like that’s the case.
EDIT:
Links given in the comment section to do with corrigibility. I’ll try and update this with some summaries as I read them.
https://www.greaterwrong.com/posts/5bd75cc58225bf0670375041/a-first-look-at-the-hard-problem-of-corrigibility
https://arbital.com/p/corrigibility/
https://arbital.com/p/updated_deference/
https://arxiv.org/pdf/1611.08219.pdf