Long story short, my VPS, which I’m forwarding my servers through Tailscale to, got hammered by thousands of requests per minute from Anthropic’s Claude AI. All of which being from different AWS IPs.

The VPS has a 1TB monthly cap, but it’s still kinda shitty to have huge spikes like the 13GB in just a couple of minutes today.

How do you deal with something like this?
I’m only really running a caddy reverse proxy on the VPS which forwards my home server’s services through Tailscale. "

I’d really like to avoid solutions like Cloudflare, since they f over CGNAT users very frequently and all that. Don’t think a WAF would help with this at all(?), but rate limiting on the reverse proxy might work.

(VPS has fail2ban and I’m using /etc/hosts.deny for manual blocking. There’s a WIP website on my root domain with robots.txt that should be denying AWS bots as well…)

I’m still learning and would really appreciate any suggestions.

    • @breadsmasher
      link
      English
      0
      edit-2
      2 hours ago

      yes i did read OP.

      ed. i see this was downvoted without a response. But il put this out there anyway.

      If you host a public site, which you expect anyone can access, there is very little you can do to exclude an AI scraper specifically.

      Hosting your own site for personal use? IP blocks etc will prevent scraping.

      But how do you identify legitimate users from scrapers? Its very difficult.

      They will use your traffic up either way. Dont want that? You could waste their time (tarpit), or take your hosting away from public access.

      Downvoter. Whats your alternative?