I know they’re just cartoons and I get the gist, but the graphs labelled “naive scenario” and “actual performance” are a little confusing.
The X axis seems to be measuring performance, with benchmarks like “high schooler” and “college student”, but in that case, what’s the Y axis? Is it the number of tasks that the model performs at that particular level? Something like that?
I think it would be helpful if you labeled the Y axis, even with just a vague label.
I know they’re just cartoons and I get the gist, but the graphs labelled “naive scenario” and “actual performance” are a little confusing.
The X axis seems to be measuring performance, with benchmarks like “high schooler” and “college student”, but in that case, what’s the Y axis? Is it the number of tasks that the model performs at that particular level? Something like that?
I think it would be helpful if you labeled the Y axis, even with just a vague label.
I was thinking of this as a histogram- probability that the model solves the task at that level of quality