Did that AI system use Doja Cat records for training data?

  • @[email protected]
    link
    fedilink
    English
    -810 months ago

    With text, it’s possible to trick the AI to reveal its license or for it to just print the license unprompted (like Github’s CoPilot), but with music, how is anybody going to detect it?

    Anti Commercial-AI license

    • @efstajas
      link
      English
      710 months ago

      What’s the point of posting that “anti commercial ai license” on every comment? It’d be so easy to filter it out of a training dataset. Or am I missing something?

      • @[email protected]
        link
        fedilink
        English
        3
        edit-2
        10 months ago

        It’s dead easy. Yet github didn’t do it when training copilot and are now sued because of it.

        It is also easy to build a database of copyrighted material and check that revealed training data marches it. The copyright licence doesn’t necessarily need to be attached. It just makes it easier to spot.

        Also, what are you arguing here? That because copyright is easy to ignore, it should be or that it’s pointless? Is that the advice you’d give anybody else too? “You know what Disney, everyone ignores copyright, so why not make everything public domain?”

        Anti Commercial-AI license

    • The Octonaut
      link
      fedilink
      English
      410 months ago

      Maybe musicians might end every song with some magic words that stop people using it.