You probably need some technical restrictions as well, but from the legal perspective: is there a license that is like Creative Commons EXCEPT for use cases like use the content for training an LLM by OpenAI or google?

Cc @[email protected]

  • @DocMcStuffin
    link
    61 year ago

    This is what we are going to find out in the next three years. Someone could come up with a license, but we won’t know if it’s viable until it gets tested by the courts. There’s still debate on whether scraping from copyrighted works then using that data to train a LLM (or any other ML model) is an allowed use. Technically, that is transformative of the original works, but it uses vast numbers of copyright works. The New York Times is looking at suing OpenAI over this exact issue.

    In any case, if you did come up with a license, you would also have to enforce it. With no clear legal precedents that would require filling suit against any violator.