But that references a number of standardized tests, some of which I suspect you have also taken. Here are a could of links to practice test that might have good matches for you to try.
Is there any information on how long the LLM spent on taking the tests? Any idea? I’d like to know the comparison with human times. (I realize it can depend on hardware, etc but would just like some general idea.)
Not sure if you’ve seen this or not: https://mashable.com/article/openai-gpt-4-exam-scores
But that references a number of standardized tests, some of which I suspect you have also taken. Here are a could of links to practice test that might have good matches for you to try.
https://www.tests.com/Free-Practice-Tests
https://www.khanacademy.org/college-careers-more/college-admissions/making-high-school-count/standardized-tests/a/full-length-sats-to-take-online
[Rewrite as I don’t think the first comment was actually helpful.]
Thank you! I didn’t see your first version of this, but your current version is helpful for the human-specific tests that they’re benchmarked on :)
Is there any information on how long the LLM spent on taking the tests? Any idea? I’d like to know the comparison with human times. (I realize it can depend on hardware, etc but would just like some general idea.)