Using Reddit’s popular ChangeMyView community as a source of baseline data, OpenAI had previously found that 2022’s ChatGPT-3.5 was significantly less persuasive than random humans, ranking in just the 38th percentile on this measure. But that performance jumped to the 77th percentile with September’s release of the o1-mini reasoning model and up to percentiles in the high 80s for the full-fledged o1 model.

So are you smarter than a Redditor?

  • @rottingleaf
    link
    English
    1
    edit-2
    12 hours ago

    It already is, at least on Armenia and Azerbaijan. EDIT: I mean, the bots were crude-ish, but they don’t have to get better. Harder goals - better bots.