I saw this post and I was curious what was out there.

https://neuromatch.social/@jonny/113444325077647843

Id like to put my lab servers to work archiving US federal data thats likely to get pulled - climate and biomed data seems mostly likely. The most obvious strategy to me seems like setting up mirror torrents on academictorrents. Anyone compiling a list of at-risk data yet?

  • OtterOP
    link
    fedilink
    English
    4610 hours ago

    One option that I’ve heard of in the past

    https://archivebox.io/

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline.

    • @CrazyLikeGollum
      link
      English
      78 hours ago

      That looks useful, I might host that. Does anyone have an RSS feed of at risk data?

    • Admiral Patrick
      link
      fedilink
      English
      11
      edit-2
      10 hours ago

      Going to check that out because…yeah. Just gotta figure out what and where to archive.

    • @M600
      link
      English
      69 hours ago

      This seems pretty cool. I might actually host this.