“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

  • Pennomi
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    1 year ago

    Yes, fair enough. I went looking for an intellectual conversation in a place that is decidedly not so. I generally forget that places like this exist.

    I’ll just show myself out. Thank you.