• @[email protected]
    link
    fedilink
    English
    352 months ago

    They’re too late, there’s going to be way too much AI generated garbage in their data and so many social media platforms like Reddit and Twitter have already taken measures to curb scrapers.

    • @[email protected]
      link
      fedilink
      English
      182 months ago

      Like those platforms aren’t already full of AI garbage as well. Training new models will require a cut-off date before the genie was let out of the bottle.

    • Drunemeton
      link
      English
      42 months ago

      I think that’s the “25-times faster” bit. They seem to be in a hurry to collect as much human-generated data as possible.

      • GHiLA
        link
        fedilink
        English
        42 months ago

        How does it know what is and isn’t?

        Uh oh.

        • Drunemeton
          link
          English
          52 months ago

          Yeah…

          Hey! Perhaps they’ll use A.I. to weed out the A.I. generated bits.

        • JackbyDev
          link
          fedilink
          English
          12 months ago

          I mean, if I could theoretically take a snapshot of the entire Internet I’d rather do it now than later because there’s just gonna be more AI later.