TurnTrout comments on TurnTrout’s shortform feed

TurnTrout 14 Dec 2022 7:39 UTC
LW: 2 AF: 2
0
AF
It can still be robustly derived as an instrumental subgoal during general-planning/problem-solving, though?
- Garrett Baker 14 Dec 2022 8:32 UTC
  LW: 1 AF: 1
  0
  AF Parent
  This is true, but indicates a radically different stage in training in which we should find deception compared to deception being an intrinsic value. It also possibly expands the kinds of reinforcement schedules we may want to use compared to the worlds where deception crops up at the earliest opportunity (though pseudo-deception may occur, where behaviors correlated with successful deception are reinforced possibly?).
  - TurnTrout 15 Dec 2022 3:53 UTC
    LW: 2 AF: 2
    0
    AF Parent
    Oh, huh, I had cached the impression that deception would be derived, not intrinsic-value status. Interesting.