The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people’s permission.

  • @SamB
    link
    English
    301 year ago

    I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.

    • @zekiz
      link
      English
      231 year ago

      But that really isn’t OpenAI’s fault. Whoever was in charge of securing the patients data really fucked up.

      • krellor
        link
        fedilink
        241 year ago

        Leaving your front door open isn’t prudent but doesn’t grant permission to others to enter and take/copy your belongings or data.

        The security teams may have royally screwed up, but OpenAI has a legal obligation to respect copyright and laws regarding data ownership.

        Likewise, they could have scraped pages that included terms of use, copyright, disclaimers, etc., and failed to honor them.

        All parties can be in the wrong for different reasons.

      • @almar_quigley
        link
        English
        141 year ago

        That’s like saying you didn’t lock your front door so whoever robs you is innocent.

        • @[email protected]
          link
          fedilink
          English
          61 year ago

          I think it’s a little closer to being mad that the Google street car drove by and snapped a picture of the front of your house, tbh.

          • @almar_quigley
            link
            English
            11 year ago

            Except pii and spi are protected under law, just like your possessions.

        • Dran
          link
          English
          61 year ago

          But does leaving your front door open allow one to legally take a picture of the inside from across the street? I’d say scraping is more akin to that than it is theft. Nothing is removed in scraping, just copied

          • @BradleyUffner
            link
            English
            21 year ago

            Bad analogy. This is like leaving your couch out on the sidewalk, then complaining when someone takes a picture of it.

        • @zekiz
          link
          English
          51 year ago

          It’s more like leaving an important letter in the open for everyone to read. It’s certainly your fault for leaving it that open.

        • @MercuryUprising
          link
          English
          21 year ago

          Yeah, but what were all these people whose data was scraped wearing?

      • Apathy Tree
        link
        fedilink
        English
        71 year ago

        It’s certainly their fault that they used it, though.

        If they cared, they could have ensured they weren’t using sensitive or otherwise highly problematic information, but they chose not to. That’s on them.

        • @MercuryUprising
          link
          English
          -31 year ago

          It’s called “disrupting” the established norms. You wouldn’t get it because you’re not on the bleeding edge of a revolutionary platform that’s seeing scalable vertical growth due to its paradigm shift.

      • jdp23
        link
        fedilink
        11 year ago

        They certainly fucked up, but it might well be OpenAI’s post too.

    • @tallwookie
      link
      English
      81 year ago

      if it was unsecured it’s basically public. whomever put that data on a publicly accessible server is at fault

      • @[email protected]
        link
        fedilink
        English
        10
        edit-2
        1 year ago

        That’s not necessarily true. Even if a company makes the mistake of not securing data correctly, those that make use of this data can still be at fault.

        If a company leaves a server wide open, you still can’t legally steal information from it.

        • @tallwookie
          link
          English
          11 year ago

          that’s kind of a grey area - digitally copying something that’s public domain isnt stealing.

        • @[email protected]
          link
          fedilink
          English
          01 year ago

          undefined> If a company leaves a server wide open, you still can’t legally steal information from it.

          I don’t see how this is any different than if Google search included text from a page that shouldn’t be public.