caption

a screenshot of the text:

Tech companies argued in comments on the website that the way their models ingested creative content was innovative and legal. The venture capital firm Andreessen Horowitz, which has several investments in A.I. start-ups, warned in its comments that any slowdown for A.I. companies in consuming content “would upset at least a decade’s worth of investment-backed expectations that were premised on the current understanding of the scope of copyright protection in this country.”

underneath the screenshot is the “Oh no! Anyway” meme, featuring two pictures of Jeremy Clarkson saying “Oh no!” and “Anyway”

screenshot (copied from this mastodon post) is of a paragraph of the NYT article “The Sleepy Copyright Office in the Middle of a High-Stakes Clash Over A.I.

  • @[email protected]
    link
    fedilink
    91
    edit-2
    10 months ago

    We need copyright reform. Life of author plus 70 for everything is just nuts.

    This is not an AI problem. This is a companies literally owning our culture problem.

    • @grue
      link
      English
      39
      edit-2
      10 months ago

      We do need copyright reform, but also fuck “AI.” I couldn’t care less about them infringing on proprietary works, but they’re also infringing on copyleft works and for that they deserve to be shut the fuck down.

      Either that, or all the output of their “AI” needs to be copyleft.

      • @SirQuackTheDuck
        link
        24
        edit-2
        10 months ago

        Not just the output. One could construct that training your model on GPL content which would have it create GPL content means that the model itself is now also GPL.

        It’s why my company calls GPL parasitic, use it once and it’s everywhere.

        This is something I consider to be one of the main benefits of this license.

        • @[email protected]
          link
          fedilink
          210 months ago

          So if I read a copyleft text or code once, because I understood and learned from it any text I write in the future also has to be copyleft?

          HOLY SHIT!

      • @Dkarma
        link
        -1510 months ago

        It already is. God you uninformed people are insufferable.

        • @grue
          link
          English
          1310 months ago

          It already is.

          If you mean that the output of AI is already copyleft, then sure, I completely agree! What I meant to write that we “need” is legal acknowledgement of that factual reality.

          The companies running these services certainly don’t seem to think so, however, so they need to be disabused of their misconception.

          I apologize if that was unclear. (Not sure the vitriol was necessary, but whatever.)

          • @Dkarma
            link
            -610 months ago

            There have already been cases decided… That’s enough

    • MustrumR
      link
      fedilink
      3110 months ago

      Going one step deeper, at the source, it’s oligarchy and companies owning the law and in consequence also its enforcement.

  • @LavaPlanet
    link
    5810 months ago

    Piracy / stealing content is ok for big corps Piracy / stealing content punishable by life in prison for us proletarians

    • @Dkarma
      link
      1310 months ago

      This is simply not stealing. Viewing content has never ever ever been stealing.

      There is no view right.

      • 🦄🦄🦄
        link
        fedilink
        1110 months ago

        They are downloading the data so thei LLM can “view” it. How is that different than downloading movies to view them?

        • @Dkarma
          link
          -7
          edit-2
          10 months ago

          They’re not downloading anything tho. That’s the point. At no point are they posessing the content that the AI is viewing.

          This is LESS intrusive than a Google web scraper. No one trying to sue Google for copyright for Google searches.

          • 🦄🦄🦄
            link
            fedilink
            510 months ago

            What? Of course they are downloading, the content still has to reach their networks and computers.

            • @Dkarma
              link
              -2
              edit-2
              10 months ago

              Go look up how ai works. There is no download lol. It’s the exact same principal as web scrapers which have been around for literally decades.

      • Jamyang
        link
        English
        910 months ago

        Tech illiterate guy here. All these Ml models require training data, right? So all these AI companies that develop new ML based chat/video/image apps require data. So where exactly do they? It can’t be that their entire dataset is licensed, isn’t it?

        If so, are there any firms that are using these orgs for data theft? How to know if the model has been trained on your data? Sorry if this is not the right place to ask.

        • @Dkarma
          link
          9
          edit-2
          10 months ago

          You know how you look at a pic on the internet and don’t pay? The AI is basically doing the same thing only it’s collecting the effect of the data points ( like pixels in a picture) more accurately. The input no matter what it is only moves a set of weights. That’s all. It does not copy anything it is trained on.

          Yes it can reproduce with some level of accuracy any work just like a painter or musician could replay a piece they see or hear.

          Again, this is not theft any more than u hearing a Song or viewing a selfie.

          • Jamyang
            link
            English
            110 months ago

            only it’s collecting the effect of the data points ( like pixels in a picture) more accurately

            Isn’t that the entire point of creativity. though? What separates an artist from a bad painter is the positioning of pixels on a 2-Dimensional plane? If the model collects the positions of pixels together with the pixel RGB (color? Don’t know the technical term for it), then the model is effectively stealing the “pixel configuration and makeup” of that artist which can be reproduced by the said model anywhere if similar prompts were passed to it?

            • @Dkarma
              link
              210 months ago

              Focus. We are talking about copyright. Copyright doesn’t cover this at all.

      • @Katana314
        link
        English
        110 months ago

        Could say piracy is just running a program that “views” the content, and then regurgitates its own interpretation of it into local data stores.

        It’s just not very creative, so it’s usually very close.

        • @Dkarma
          link
          1
          edit-2
          10 months ago

          You could say that but you’d be wrong.downloading is a bitwise copy. Training isn’t even close to the same thing.

      • @[email protected]
        link
        fedilink
        19 months ago

        That’s one thing, but I think regurgitating it and claiming it as your own is a completely different thing.

        • @[email protected]
          link
          fedilink
          19 months ago

          Also, I’m pretty sure the argument is more about the unequal enforcement of the law. Copyright should be either enforced fairly or not at all. If AI is allowed to scrape content and regurgitate it, piracy should also be legal.

        • @Dkarma
          link
          09 months ago

          Again that’s not what’s happening here

  • @Aremel
    link
    4410 months ago

    I’m gonna play them a song on the world’s smallest violin.

  • @postnataldrip
    link
    4310 months ago

    That’s what would be called “a swing and a miss”

    It’s almost like speculating has risks

  • @Gradually_Adjusting
    link
    English
    3410 months ago

    “You can’t just decide we were wrong about IP, that would make us broke!”

  • @[email protected]
    link
    fedilink
    English
    2610 months ago

    Can we just put all the media and technology executives in an alley where they can fight it out like the scene from Anchorman?

    • @[email protected]
      link
      fedilink
      310 months ago

      I saw that! Brick killed a guy! Did you throw a trident?

      Yeah, there were horses, and a man on fire, and I killed a guy with a trident!

      Brick, I’ve been meaning to talk to you about that. You should find yourself a safehouse or a relative close by. Lay low for a while, because you’re probably wanted for murder.

  • @[email protected]
    link
    fedilink
    2210 months ago

    Either this kill large AI models (at least commercial). Or it kill some copyright bs in some way. Whatever happens, society wins.

    Second option could also hurt small creator though.

    • @[email protected]
      link
      fedilink
      910 months ago

      I fear this is a giant power grab. What this will lead to is that IP holders, those that own the content that AI needs to train will dictate prices. So all the social media content you kindly gave reddit, facebook, twitter, pictures, all that stuff means you won’t be able to have any free AI software.

      No free / open source AI software means there is a massive power imbalance because now only those who can afford to buy this training data and do it, any they are forced to maximize profits (and naturally inclined anyway).

      Basically they will own the “means of generation” while we won’t.

      • @[email protected]
        link
        fedilink
        510 months ago

        Current large model would all be sued to death, no license with IP owner yet, would kill all existing commercial large models. Except all IP owner are named and license granted retroactive, but sound unlikely.

        Hundred of IP owner company and billion of individual IP owner setting prices will probably behave like streaming: price increase and endless fragmentation. Need a license for every IP owner, paperwork will be extremely massive. License might change, expire, same problem as streaming but every time license expire need to retrain entire model (or you infringe because model keep using data).

        And in the EU you have right to be forgotten, so excluded from models (because in this case not transformative enough, ianal but sound like it count as storing), so every time someone want to be excluded, retrain entire model.

        Do not see where it possible to create large model like this with any amount of money, time, electricity. Maybe some smaller models. Maybe just more specific for one task.

        Also piracy exists, do not care about copyright, will just train and maybe even open source (torrent). Might get caught, might not, might become dark market, idk. Will exist though, like deepfakes.

        • @[email protected]
          link
          fedilink
          210 months ago

          Yeah those are the myriad of complications this will cause. People are worried about AI, I’m too, but we need smart regulation not to use IP laws that only increases power of the ultra-rich. Because if AI will continue to exist, that will severely distort and limit the market to very specific powerful entities. And that is almost certainly going to be worse than completely unregulated.

    • @[email protected]
      link
      fedilink
      610 months ago

      I know plenty of small creators who urge me to pirate their content.

      Because all they want is people to enjoy their content, and piracy helps spread their art.

      So even small creators are against copyright.

  • @[email protected]
    link
    fedilink
    1710 months ago

    I mean, I won’t deny that small bit of skill it took to construct a plausible sounding explanation for why the public should support your investment, because it’s “not illegal (yet)”.

    • @killeronthecorner
      link
      English
      2010 months ago

      “technically this thievery isn’t covered by law”

      "technically this what?”

      "OBJECTION!”

      • @Dkarma
        link
        010 months ago

        You stole my post by looking at it. Pay me.

  • @[email protected]
    link
    fedilink
    610 months ago

    That’s the point about money, if you have enough you can simply sue or bribe in order to not lose money.

  • @psycho_driver
    link
    310 months ago

    “In other news, the new Dacia Sandero looks fabulous.”

  • @db2
    link
    310 months ago

    “warned”

    Or what? I want to see that bluster called.

  • @Harbinger01173430
    link
    210 months ago

    A’ight. Time to self host the entire of the internet in a server and do machine learning with the content I stored. :)

    • @[email protected]
      link
      fedilink
      610 months ago

      As Robert Evan’s put it: “If we can’t steal every book ever written, we’ll go broke!”

    • @[email protected]
      link
      fedilink
      510 months ago

      A scammer made unreasonable promises to investors and is now warning everyone that their victims/investors are going to lose money when the process of making fair laws takes the typical amount of time that it always takes.

    • @[email protected]
      link
      fedilink
      410 months ago

      They made investments and projections on their business based on the current laws and they’ll be sad if the laws change now.