A bipartisan group of senators introduced a new bill to make it easier to authenticate and detect artificial intelligence-generated content and protect journalists and artists from having their work gobbled up by AI models without their permission.

The Content Origin Protection and Integrity from Edited and Deepfaked Media Act (COPIED Act) would direct the National Institute of Standards and Technology (NIST) to create standards and guidelines that help prove the origin of content and detect synthetic content, like through watermarking. It also directs the agency to create security measures to prevent tampering and requires AI tools for creative or journalistic content to let users attach information about their origin and prohibit that information from being removed. Under the bill, such content also could not be used to train AI models.

Content owners, including broadcasters, artists, and newspapers, could sue companies they believe used their materials without permission or tampered with authentication markers. State attorneys general and the Federal Trade Commission could also enforce the bill, which its backers say prohibits anyone from “removing, disabling, or tampering with content provenance information” outside of an exception for some security research purposes.

(A copy of the bill is in he article, here is the important part imo:

Prohibits the use of “covered content” (digital representations of copyrighted works) with content provenance to either train an AI- /algorithm-based system or create synthetic content without the express, informed consent and adherence to the terms of use of such content, including compensation)

  • RubberDuck
    link
    English
    401 month ago

    Closing the door behind the ones that already did it means only the current groups that have the data will make money of it.

      • v_krishna
        link
        fedilink
        English
        101 month ago

        This regulation (and similar being proposed in California) would not be applied retroactively.

          • @[email protected]
            link
            fedilink
            English
            151 month ago

            Since no retroactive measures are mentioned, the companies that already scraped the web won’t be stopped from continuing to use the AI models already trained on that data, but anyone else would be stopped by the law.

            It is like making it illegal to rob banks after someone already robbed all the banks and letting them keep all the money.

            The law could have made it illegal for use of models trained on the copyrighted materials without permission instead of targeting the process for collecting it.

    • @just_another_person
      link
      English
      -181 month ago

      Downvote all you want. If your entire business or personal model includes stealing content from other people, then you need to rethink that.

      • RubberDuck
        link
        English
        181 month ago

        “stealing” implies the owner does not have it anymore… It is large studio speak.

        And I get what you are trying so say, I just think the copyright system is so broken that this shows it is in need of reform. Because if the qualm is with people doing immoral shit as a business model, there are long lists of corporations that will ask you to hold their beer.

        And the fact that the training of the models already occurred on these materials means that the owners of the current models are probably training on generated datasets meaning that by the time this actually hits court, the datasets with original copyrighted materials will be obsolete.

        • @[email protected]
          link
          fedilink
          English
          21 month ago

          Regarding obsolete models, that’s only partially true. There’s loads of content that are effectively “finished” and won’t be changing, and will grow obsolete at a fairly slow pace. Meaning they’ll be useful in the models once trained for years.

          Obviously new technology and similar ideas/content that didn’t exist when the model was created won’t be there, but the amount that changes and or is new is relatively small each year compared to all the historical content.

          • RubberDuck
            link
            English
            81 month ago

            Well that’s a well articulated reply.

            I don’t understand why you would take this position. Because the small artists will never be able to avoid Beiing included in training sets, and if they are what are they going to do against a VC backed corpnlike OpenAI. All the while the big copyright “owners” will be excluded. Meaning this only cements the position of the mega corps.

      • @afraid_of_zombies
        link
        English
        31 month ago

        Stealing: depriving you of what you own

        Copying: taking a picture of what you made.

        Stealing is not copying. You still have whatever you started with.