• @[email protected]
    link
    fedilink
    English
    16
    edit-2
    11 months ago

    I think we can give facebook/threads the bad end of the bargin IF we have a data protections.

    You know how powerful copy-left was for open source? I think we can do the same for Lemmy servers. We can have users agree (formally) that the data on a particular server cannot be used for training llvm’s advertisements, marketing profiles, etc, and make it legally binding.

    Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless. Maybe there is already something like this and I’m just unaware of it.

    If we do add these protections and we ensure that the largest instance (e.g. Lemmy.world) is community controlled, I think it could work well for bringing more content to Lemmy.

    • Pennomi
      link
      English
      911 months ago

      Yep, on a public forum like this we lose very little on privacy by federating with them. What we do stand to lose is comment and post quality, but that’s trivial to fix by simply blocking threads on a personal level.

    • @AustralianSimon
      link
      English
      511 months ago

      You can scrape Lemmy instances for training data without even running an instance.

      • @[email protected]
        link
        fedilink
        English
        0
        edit-2
        11 months ago

        Yeah, sorry if I’m not great at communicating. That’s exactly what I’m trying to point out when I said:

        Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless.

        • @AustralianSimon
          link
          English
          111 months ago

          That’s the thing, anything public is fair game. This is why Reddit is ruining their API.

          • @[email protected]
            link
            fedilink
            English
            011 months ago

            It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models

            • @AustralianSimon
              link
              English
              111 months ago

              They changed the terms and made the API pay to use for large volumes of use. People using it to train models have already pillaged what they need and you can get the data prior to APIgeddon elsewhere.

              • @[email protected]
                link
                fedilink
                English
                0
                edit-2
                11 months ago

                Sure, but it’s still true that there are legal protections we can add that make it not fair game for Lemmy. At best it would be unfair-game (illegal scraping of Lemmy)

                • @AustralianSimon
                  link
                  English
                  111 months ago

                  A rule for one Lemmy or even the Lemmy app doesn’t mean same rule applies across ActivityPub Federation, if your data federated to my instance, it’s mine too.