We have paused all crawling as of Feb 6th, 2025 until we implement robots.txt support. Stats will not update during this period.

  • Semi-Hemi-Lemmygod
    link
    English
    197 hours ago

    Robots.txt is a lot like email in that it was built for a far simpler time.

    It would be better if the server could detect bots and send them down a rabbit hole rather than trusting randos to abide by the rules.

    • SwizzleStick
      link
      fedilink
      English
      116 hours ago

      It would be better if the server could detect bots and send them down a rabbit hole

      Already possible: Nepenthes.

      • Semi-Hemi-Lemmygod
        link
        English
        106 hours ago

        ANY SITE THIS SOFTWARE IS APPLIED TO WILL LIKELY DISAPPEAR FROM ALL SEARCH RESULTS.

        I’m sold

    • poVoq
      link
      fedilink
      English
      126 hours ago

      Because of AI bots ignoring robots.txt (especially when you don’t explicitly mention their user-agent and rather use a * wildcard) more and more people are implementing exactly that and I wouldn’t be surprised if that is what triggered the need to implement robots.txt support for FediDB.