• umami_wasabi
    link
    fedilink
    English
    1817 hours ago

    How can I do this without Cloudflare?

    • Rikudou_Sage
      link
      fedilink
      English
      2117 hours ago

      Put a page on your website saying that scrapping your website costs [insert amount] and block the bots otherwise.

        • melroy
          link
          fedilink
          412 hours ago

          Also you don’t want to block legit search engines that are not scraping your data for AI.

          • @[email protected]
            link
            fedilink
            English
            412 hours ago

            Again: hard to differentiate all those different bots, because you have to trust that they are what they say they are, and they often are not

              • @vinnymac
                link
                English
                2
                edit-2
                7 hours ago

                It certainly can be a cat and mouse game, but scraping at scale tends to be ahead of the curve of the security teams. Some examples:

                https://brightdata.com/

                https://oxylabs.io/

                Preventing access by requiring an account, with strict access rules can curb the vast majority of scraping, then your only bad actors are the rich venture capitalists.