Garrett Baker comments on TurnTrout’s shortform feed

Garrett Baker 14 Dec 2022 8:32 UTC
LW: 1 AF: 1
0
AF
This is true, but indicates a radically different stage in training in which we should find deception compared to deception being an intrinsic value. It also possibly expands the kinds of reinforcement schedules we may want to use compared to the worlds where deception crops up at the earliest opportunity (though pseudo-deception may occur, where behaviors correlated with successful deception are reinforced possibly?).
- TurnTrout 15 Dec 2022 3:53 UTC
  LW: 2 AF: 2
  0
  AF Parent
  Oh, huh, I had cached the impression that deception would be derived, not intrinsic-value status. Interesting.