For example, if I ask my system to produce a cartoon drawing, it doesn’t seem very notable if I get a cartoon as a result rather than a photorealistic image, even if it could have produced the latter.
Consider instead the scenario where I show a model a photo of a face, and the model produces a photo of the side of that face. An interesting question is “is there a 3d representation of the face in the model?”. It could be getting the right answer that way, or it could be getting it some other way.
Similarly, when it models a ‘dumb’ character, is it calculating the right answer, and then computing an error? Or is it just doing something dumb, which incidentally turns out to be wrong?
Consider instead the scenario where I show a model a photo of a face, and the model produces a photo of the side of that face. An interesting question is “is there a 3d representation of the face in the model?”. It could be getting the right answer that way, or it could be getting it some other way.
Similarly, when it models a ‘dumb’ character, is it calculating the right answer, and then computing an error? Or is it just doing something dumb, which incidentally turns out to be wrong?
Like, when you look at this example:
How did it come up with 19 and 20? What would it take to make tools that could answer that question?
This framing makes sense to me. Thanks!