Wei Dai comments on Contest: $1,000 for good questions to ask to an Oracle AI

Wei Dai 9 Aug 2019 4:34 UTC
LW: 8 AF: 3
AF
It looks like my entry is pretty close to the ideas of Human-in-the-counterfactual-loop and imitation learning and apprenticeship learning. Questions:
1. Stuart, does it count against my entry that it’s not actually a very novel idea? (If so, I might want to think about other ideas to submit.)
2. What is the exact relationship between all these ideas? What are the pros and cons of doing human imitation using this kind of counterfactual/online-learning setup, versus other training methods such as GAN (see Safe training procedures for human-imitators for one proposal)? It seems like there are lots of posts and comments about human imitations spread over LW, Arbital, Paul’s blog and maybe other places, and it would be really cool if someone (with more knowledge in this area than I do) could write a review/distillation post summarizing what we know about it so far.
What links here?
- What specific dangers arise when asking GPT-N to write an Alignment Forum post? by Matthew Barnett (28 Jul 2020 2:56 UTC; 45 points)
- Stuart_Armstrong 9 Aug 2019 17:33 UTC
  LW: 2 AF: 1
  AF Parent
  1. I encourage you to submit other ideas anyway, since your ideas are good.
  2. Not sure yet about how all these things relate; will maybe think of that more later.