• @bill_1992
    link
    1631 year ago

    Everyone loves the idea of scraping, no one likes maintaining scrapers that break once a week because the CSS or HTML changed.

      • @Anonymousllama
        link
        221 year ago

        This one. One of the best motivators. Sense of satisfaction when you get it working and you feel unstoppable (until the next subtle changes happens anyway)

    • @camr_on
      link
      281 year ago

      I loved scraping until my ip was blocked for botting lol. I know there’s ways around it it’s just work though

      • Pennomi
        link
        English
        421 year ago

        I successfully scraped millions of Amazon product listings simply by routing through TOR and cycling the exit node every 10 seconds.

        • @camr_on
          link
          151 year ago

          That’s a good idea right there, I like that

        • ferret
          link
          fedilink
          English
          51 year ago

          lmao, yeah, get all the exit nodes banned from amazon.

          • Pennomi
            link
            English
            121 year ago

            That’s the neat thing, it wouldn’t because traffic only spikes for 10s on any particular node. It perfectly blends into the background noise.

        • @camr_on
          link
          71 year ago

          I’m coding baby’s first bot over here lol, I could probably do better

    • @dangblingus
      link
      111 year ago

      Or in the case of wikipedia, every table on successive pages for sequential data is formatted differently.

    • @Matriks404
      link
      71 year ago

      Just use AI to make changes ¯_(ツ)_/¯