Stuart_Armstrong comments on Counterfactual do-what-I-mean

Stuart_Armstrong 1 Nov 2016 14:21 UTC
0 points
AF
This idea is actually very similar to Paul’s idea, but doesn’t require such an ideal setup.
- jessicata 3 Nov 2016 3:04 UTC
  0 points
  AF Parent
  I still don’t understand the motivation. Is the hope that “what <X value learning algorithm> would infer from observing humans in some hypothetical that doesn’t actually happen” is easier to make inferences about than “what humans would do if they thought for a very long time”?