Gordon Seidoh Worley comments on Imitation learning considered unsafe?

Gordon Seidoh Worley 7 Jan 2019 18:50 UTC
2 points
If I’m taking your point correctly, it seems you’re concerned about Goodharting in imitation learning. I agree, this seems a major issue, and I think people are aware of it and thinking about ways to address it.
- David Scott Krueger (formerly: capybaralet) 9 Jan 2019 18:27 UTC
  1 point
  Parent
  I don’t think I’d put it that way (although I’m not saying it’s inaccurate). See my comments RE “safety via myopia” and “inner optimizers”.