Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Stuart_Armstrong comments on
In theory: does building the subagent have an “impact”?
Stuart_Armstrong
17 Feb 2020 14:15 UTC
2
points
It’s the delta of that with
Q
R
(
s
t
+
1
,
a
t
+
1
)
that is penalised, not large changes on its own.
Back to top
It’s the delta of that with QR(st+1,at+1) that is penalised, not large changes on its own.