• @cm0002
    link
    English
    11 day ago

    That’s not how training works with LLMs at all

    • nickwitha_k (he/him)
      link
      fedilink
      English
      -21 day ago

      I can’t recall a time when I downloaded an album, took samples of the entire thing, pretended I made it without making any actual alterations, and tried to sell access to it.

      • @cm0002
        link
        English
        41 day ago

        It does make alterations of it, it’s completely shredded up as part of the training process and turned into numbers and statistics mushed with a bunch of other numbers and statistics.

        It’s like baking a cake, you mix in flour, butter, eggs, and bake it. Once mixed and baked you can’t get the flour, butter and eggs back to their original form and the final product is completely different

        If it wasn’t you’d be able to pull full unaltered copies directly from the model files, but that hasn’t been accomplished. The best that people have been able to do is get the AI to recreate something pretty close to the original with very careful and specific prompts. But it’s still a recreation, based on what it “learned”.

        • nickwitha_k (he/him)
          link
          fedilink
          English
          41 day ago

          Yes, my edit was a bit hyperbolic. The point being that current AI/LLM companies have been, at best, encoding data that they do not have permission to use into their models.

          It’s like baking a cake, you mix in flour, butter, eggs, and bake it. Once mixed and baked you can’t get the flour, butter and eggs back to their original form and the final product is completely different

          It’s more like baking a cake with flour, butter, and eggs that you snagged from other people’s grocery baskets after they paid for them. Then, started selling the cakes made from said ingredients.

          Ideally, none of that would matter because knowledge and data want to be free and everyone would benefit. However, we don’t live in such a world. Instead, the technology is being used almost exclusively to extract wealth from people and make the average human being’s life worse, both in the short-term by reducing their ability to support themselves and in the long-term by drastically increasing consumption of fossil fuels and potable water, putting more pressure on the biosphere.