A new report from plagiarism detector Copyleaks found that 60% of OpenAI’s GPT-3.5 outputs contained some form of plagiarism.

Why it matters: Content creators from authors and songwriters to The New York Times are arguing in court that generative AI trained on copyrighted material ends up spitting out exact copies.

  • FaceDeer
    link
    fedilink
    339 months ago

    Not to mention that a response “containing” plagiarism is a pretty poorly defined criterion. The system being used here is proprietary so we don’t even know how it works.

    I went and looked at how low theater and such were and it’s dramatic:

    The lowest similarity scores appeared in theater (0.9%), humanities (2.8%) and English language (5.4%).

    • BattleGrown
      link
      English
      99 months ago

      Yeah, anyone who has written a thesis knows those tools are bullshit. My handwritten 140 page master’s thesis had a similarity index of 11%.