• TimeSquirrel
    link
    fedilink
    -12
    edit-2
    9 months ago

    This argument was settled with electronic music in the 80s/90s. Samples and remixes taken directly from other bits of music to create a new piece aren’t plagiarism.

    • IWantToFuckSpez
      link
      fedilink
      10
      edit-2
      9 months ago

      Yeah that’s because sampling artists pay royalties to the owners of those bits of music. It’s only fair use royalty free if the sample is transformed in such a way that you can’t trace it back to the source.

      Some generative AI literally create images from movies that can be traced back to the source, that’s not fair use.

      • TimeSquirrel
        link
        fedilink
        0
        edit-2
        9 months ago

        if the sample is transformed in such a way that you can’t trace it back to the source

        If I pop a quote wholly generated by ChatGPT into Google, chances are very good that I am NOT going to find that exact quote anywhere online. It has been fully transformed from whatever training data it had on the subject.

          • TimeSquirrel
            link
            fedilink
            0
            edit-2
            9 months ago

            Source: my own findings. Why not see for yourself? Has anybody here actually used AI, or just shittalking from the stands?

      • Xhieron
        link
        English
        09 months ago

        And you’re absolutely right about that. That’s not the same thing as LLMs being incapable of constituting anything written in a novel way, but that they will readily with very little prodding regurgitate complete works verbatim is definitely a problem. That’s not a remix. That’s publishing the same track and slapping your name on it. Doing it two bars at a time doesn’t make it better.

        It’s so easy to get ChatGPT, for example, to regurgitate its training data that you could do it by accident (at least until someone published it last year). But, the critics cry, you’re using ChatGPT in an unintended way. And indeed, exploiting ChatGPT to reveal its training data is a lot like lobotomizing a patient or torture victim to get them to reveal where they learned something, but that really betrays that these models don’t actually think at all. They don’t actually contribute anything of their own; they simply have such a large volume of data to reorganize that it’s (by design) impossible to divine which source is being plagiarised at any given token.

        Add to that the fact that every regulatory body confronted with the question of LLM creativity has so far decided that humans, and only humans, are capable of creativity, at least so far as our ordered societies will recognize. By legal definition, ChatGPT cannot transform (term of art) a work. Only a human can do that.

        It doesn’t really matter how an LLM does what it does. You don’t need to open the black box to know that it’s a plagiarism machine, because plagiarism doesn’t depend on methods (or sophisticated mental gymnastics); it depends on content. It doesn’t matter whether you intended the work to be transformative: if you repeated the work verbatim, you plagiarized it. It’s already been demonstrated that an LLM, by definition, will repeat its training data a non-zero portion of the time. In small chunks that’s indistinguishable, arguably, from the way a real mind might handle language, but in large chunks it’s always plagiarism, because an LLM does not think and cannot “remix”. A DJ can make a mashup; an AI, at least as of today, cannot. The question isn’t whether the LLM spits out training data; the question is the extent to which we’re willing to accept some amount of plagiarism in exchange for the utility of the tool.

      • @[email protected]
        link
        fedilink
        English
        -19 months ago

        If AI’s are plagarism machines, then the mentioned situation must be example of DJs plagarising

    • @[email protected]
      link
      fedilink
      English
      09 months ago

      The samples were intentionally rearranged and mixed with other content in a new and creative way.

      When sampling took off, the copyright situation was sorted out and the end result is that there are ways to license samples. Some samples are produced like stock footage hat could be pirchased inexpensively, which is why a lot of songs by different artists have the same samples included. Samples of specific songs have to be licensed, so a hip hop song with a riff from an older famous song had some kind of licensing or it wouldnt be played on the radio or streaming services. They might have paid one time, or paid an artist group for access to a bunch of songs, basically the same kind of thing as covers.

      Samples and covers are not plagarism if they are licensed and credit their source. Both are creating someing new, but using and crediting existing works.

      AI is doing the same sampling and copying, but trying to pretend that it is somehow not sampling and copying and the companies running AI don’t want to credit the sources or license the content. That is why AI is plagarism.

    • @[email protected]
      link
      fedilink
      0
      edit-2
      9 months ago

      Not even remotely the same. A producer still has to choose what to sample, and what to do with it.

      An AI is just a black box with a “create” button.