paulfchristiano comments on Counterfactual Oracles = online supervised learning with random selection of training episodes

paulfchristiano 10 Sep 2019 15:28 UTC
LW: 7 AF: 4
AF
Episodic learning algorithms will still penalize this behavior if it appears on the training distribution, so it seems reasonable to call this an inner alignment problem.
- Ofer 10 Sep 2019 18:52 UTC
  LW: 1 AF: 1
  AF Parent
  Ah, I agree (edited my comment above accordingly).