Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”

    • @BluesF
      link
      English
      69 months ago

      “Near-human” is marketing speak for “not as good as a human and there is no measurable scale to say how close it is so we will say it is closed”

    • P03 Locke
      link
      fedilink
      English
      39 months ago

      How big of a paycheck did the “journalist” get paid on this one?

  • @nnullzz
    link
    English
    19 months ago

    Just gave Claude a try and although there are similarities with the other AI and how it “feels”, it responds with a certain depth and “open-mindedness” that I don’t think I’ve experienced with the other ones. Planning on playing around with it for a couple days to see the range of its helpfulness.

    • Dizzy Devil Ducky
      link
      fedilink
      English
      19 months ago

      I don’t have access to whatever their latest public model is and don’t know if the one on their website has updated in the past few months, but it’s by far my favorite AI model for generated text. Out of the stories I’ve had it generate, it’s by far the best compared to Perplexity and ChatGPT, at least for my standards.

  • @[email protected]
    link
    fedilink
    English
    1
    edit-2
    9 months ago

    The devs and shills for Claude also claimed it to be able to analyze a full document and give results, but it can’t, it just lies to you and says it can and then posts no download link for the results it said it wrote out.