Millions of articles from The New York Times were used to train chatbots that now compete with it, the lawsuit said.

  • @kromem
    link
    English
    611 months ago

    Spaceballs extracts almost all of its value from Star Wars without paying for it.

    You absolutely can extract value from things when the way in which you do it is fair use.

    Which is typically considered to be use that is transformative enough so as to not simply be derivative, or in the public interest.

    And I think you’d have a very difficult time showing LLMs general use to be derivative of any specific part of the training data.

    We’ll see soon, as these court cases resolve.

    And if the cases find in favor of the plaintiffs, “not charging” isn’t going to work out. You can’t copy material and not charge for it and get away with it. If there’s prior law that training is infringement, it’s unlikely the decision will be worded so narrowly that similar cases against companies that don’t charge will be found not to be infringement.

    Keep in mind one of the pending cases is against Meta, whose model is completely free to access and use.

    • @assassin_aragorn
      link
      311 months ago

      Just want to say this is great food for thought. Its going to take me time to mull over it