Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”

    • @BluesF
      link
      English
      67 months ago

      “Near-human” is marketing speak for “not as good as a human and there is no measurable scale to say how close it is so we will say it is closed”

    • P03 Locke
      link
      fedilink
      English
      37 months ago

      How big of a paycheck did the “journalist” get paid on this one?