“Decision Transformer” (Tool AIs are secret Agent AIs)

gwern9 Jun 2021 1:06 UTC

LW: 37 AF: 17

4 comments1 min readLW link

AI Machine Learning (ML)

Link post

What links here?

gwern's comment on Ngo and Yudkowsky on alignment difficulty by Eliezer Yudkowsky (17 Nov 2021 2:15 UTC; 5 points)

John Schulman 9 Jun 2021 15:46 UTC
LW: 11 AF: 7
0
AF
Basically agree—I think that a model trained by maximum likelihood on offline data is less goal-directed than one that’s trained by an iterative process where you reinforce its own samples (aka online RL), but still somewhat goal directed. It needs to simulate a goal-directed agent to do a good job at maximum likelihood. OTOH it’s mostly concerned with covering all possibilities, so the goal directed reasoning isn’t emphasized. But with multiple iterations, the model can improve quality (-> more goal directedness) at the expense of coverage/diversity.
gwern 9 Jun 2021 1:07 UTC
LW: 8 AF: 2
0
AF
Rewards need not be written in natural language as crudely as “REWARD: +10 UTILONS”. Something to think about as you continue to write text online.

And what of the dead? I own that I thought of myself, at times, almost as dead. Are they not locked below ground in chambers smaller than mine was, in their millions of millions? There is no category of human activity in which the dead do not outnumber the living many times over. Most beautiful children are dead. Most soldiers, most cowards. The fairest women and the most learned men – all are dead. Their bodies repose in caskets, in sarcophagi, beneath arches of rude stone, everywhere under the earth. Their spirits haunt our minds, ears pressed to the bones of our foreheads. Who can say how intently they listen as we speak, or for what word?
evhub 9 Jun 2021 2:03 UTC
LW: 6 AF: 4
0
AF
(Moderation note: added to the Alignment Forum from LessWrong.)
mtaran 10 Jun 2021 5:36 UTC
1 point
0
Nice video reviewing this paper at https://youtu.be/-buULmf7dec

In my experience it’s reasonably easy to listen to such videos while doing chores etc.