LLMs are solving MCAT, the bar test, SAT etc like they’re nothing. At this point their performance is super human. However they’ll often trip on super simple common sense questions, they’ll struggle with creative thinking.

Is this literally proof that standard tests are not a good measure of intelligence?

  • @[email protected]
    link
    fedilink
    99 months ago

    LLMs have a good time with standardized tests like SAT precisely because they’re standardized, i.e. there’s enough information on the internet for them to parrot on them

    Try something more complex and free-form and where a human might have to work a little more to break it down into actual little subtasks with their intelligence - and then solve it, LLMs in the best case scenario will just say they don’t know how to do it, and in the worst case scenario they’ll hallucinate some actual bullshit.