jessicata comments on Counterfactual do-what-I-mean

jessicata 3 Nov 2016 3:04 UTC
0 points
AF
I still don’t understand the motivation. Is the hope that “what <X value learning algorithm> would infer from observing humans in some hypothetical that doesn’t actually happen” is easier to make inferences about than “what humans would do if they thought for a very long time”?