A bipartisan group of senators introduced a new bill to make it easier to authenticate and detect artificial intelligence-generated content and protect journalists and artists from having their work gobbled up by AI models without their permission.

The Content Origin Protection and Integrity from Edited and Deepfaked Media Act (COPIED Act) would direct the National Institute of Standards and Technology (NIST) to create standards and guidelines that help prove the origin of content and detect synthetic content, like through watermarking. It also directs the agency to create security measures to prevent tampering and requires AI tools for creative or journalistic content to let users attach information about their origin and prohibit that information from being removed. Under the bill, such content also could not be used to train AI models.

Content owners, including broadcasters, artists, and newspapers, could sue companies they believe used their materials without permission or tampered with authentication markers. State attorneys general and the Federal Trade Commission could also enforce the bill, which its backers say prohibits anyone from “removing, disabling, or tampering with content provenance information” outside of an exception for some security research purposes.

(A copy of the bill is in he article, here is the important part imo:

Prohibits the use of “covered content” (digital representations of copyrighted works) with content provenance to either train an AI- /algorithm-based system or create synthetic content without the express, informed consent and adherence to the terms of use of such content, including compensation)

  • @just_another_person
    link
    English
    -232 months ago

    Don’t see an issue with this. People who scrape copyrighted content should pay for it.

    • RubberDuck
      link
      English
      402 months ago

      Closing the door behind the ones that already did it means only the current groups that have the data will make money of it.

        • v_krishna
          link
          fedilink
          English
          102 months ago

          This regulation (and similar being proposed in California) would not be applied retroactively.

            • @[email protected]
              link
              fedilink
              English
              152 months ago

              Since no retroactive measures are mentioned, the companies that already scraped the web won’t be stopped from continuing to use the AI models already trained on that data, but anyone else would be stopped by the law.

              It is like making it illegal to rob banks after someone already robbed all the banks and letting them keep all the money.

              The law could have made it illegal for use of models trained on the copyrighted materials without permission instead of targeting the process for collecting it.

      • @just_another_person
        link
        English
        -182 months ago

        Downvote all you want. If your entire business or personal model includes stealing content from other people, then you need to rethink that.

        • RubberDuck
          link
          English
          182 months ago

          “stealing” implies the owner does not have it anymore… It is large studio speak.

          And I get what you are trying so say, I just think the copyright system is so broken that this shows it is in need of reform. Because if the qualm is with people doing immoral shit as a business model, there are long lists of corporations that will ask you to hold their beer.

          And the fact that the training of the models already occurred on these materials means that the owners of the current models are probably training on generated datasets meaning that by the time this actually hits court, the datasets with original copyrighted materials will be obsolete.

          • @[email protected]
            link
            fedilink
            English
            22 months ago

            Regarding obsolete models, that’s only partially true. There’s loads of content that are effectively “finished” and won’t be changing, and will grow obsolete at a fairly slow pace. Meaning they’ll be useful in the models once trained for years.

            Obviously new technology and similar ideas/content that didn’t exist when the model was created won’t be there, but the amount that changes and or is new is relatively small each year compared to all the historical content.

            • RubberDuck
              link
              English
              82 months ago

              Well that’s a well articulated reply.

              I don’t understand why you would take this position. Because the small artists will never be able to avoid Beiing included in training sets, and if they are what are they going to do against a VC backed corpnlike OpenAI. All the while the big copyright “owners” will be excluded. Meaning this only cements the position of the mega corps.

        • @afraid_of_zombies
          link
          English
          32 months ago

          Stealing: depriving you of what you own

          Copying: taking a picture of what you made.

          Stealing is not copying. You still have whatever you started with.

    • @afraid_of_zombies
      link
      English
      42 months ago

      Yes it is perfectly appropriate for someone who burned a backup copy of a DVD they paid for to go to prison for ten years