Google just announced it’s giving website publishers a way to opt out of having their data used to train the company’s AI models while remaining accessible through Google Search.

many sites have moved to block the web crawler that OpenAI uses to scrape data and train ChatGPT. However, there have been concerns over how to block out Google. After all, websites can’t close off Google’s crawlers completely, or else they won’t get indexed in search.

  • @[email protected]
    link
    fedilink
    248 months ago

    Nice of these AI companies to add opt out capability after they’ve trained their models on all available data.

    • @[email protected]
      link
      fedilink
      98 months ago

      This is by design. Helps to stop any competition as they will now struggle to get the same amount of training data.

  • Nina
    cake
    link
    fedilink
    English
    118 months ago

    This overall bullshit of doing opt-out fucking sucks. Opt-IN! Pissing off people and the. going “oh well there’s an opt-out” is so stupid. New experimental features that don’t directly benefit the user should be that they must have to opt-in manually. Off by default.

    • @[email protected]
      link
      fedilink
      English
      08 months ago

      When a company gives you a bunch of free services that work pretty well and everybody uses them, they get a little cocky

  • @SheeEttin
    link
    English
    58 months ago

    This doesn’t change anything. They’re still violating copyright law if they scrape any data.

    For search results, nobody really cared, because it was useful to everyone. Now they’re reprocessing it into output and competing with the original sites. That’s a big no-no.