Vaniver comments on $1000 bounty for OpenAI to show whether GPT3 was “deliberately” pretending to be stupider than it is

Vaniver 23 Jul 2020 1:40 UTC
LW: 17 AF: 5
AF
For example, if I ask my system to produce a cartoon drawing, it doesn’t seem very notable if I get a cartoon as a result rather than a photorealistic image, even if it could have produced the latter.
Consider instead the scenario where I show a model a photo of a face, and the model produces a photo of the side of that face. An interesting question is “is there a 3d representation of the face in the model?”. It could be getting the right answer that way, or it could be getting it some other way.
Similarly, when it models a ‘dumb’ character, is it calculating the right answer, and then computing an error? Or is it just doing something dumb, which incidentally turns out to be wrong?
Like, when you look at this example:
> You say “let me rephrase: What is 8 + 2 + 8 + 1 + 1?”
“19?”
Holo says with a hopeful voice.
She looks at the screen, and you see her face drop as she reads the correct answer.
“20.… I lost again...”
How did it come up with 19 and 20? What would it take to make tools that could answer that question?
- ESRogs 23 Jul 2020 7:20 UTC
  LW: 2 AF: 1
  AF Parent
  This framing makes sense to me. Thanks!