Hi guys,

I have archivebox running on my server and am very happy with it overall. But one thing bothers me: The archived pages are stored in subfolders whose names do not reflect the content. I would like to have at least a reasonable backup of the archived pages in case the archivebox instance stops working. It would also make it easier to transfer the archived pages to other devices or other people.

My question is therefore whether there is a docker self-hosted web archive solution that offers a similar range of functions to archivebox in terms of the different storage formats, e.g. for videos, but which makes the files easier to search in their folder structure.

Thanks in advance

PS

These are the solutions I have alteady tried:

  • linkwarden: archived sites are numbered 1,2,3…

  • linkding: archived sites have reasonably legible names, but no archiving of video

Which do you recommend for my requirements?

  • @[email protected]
    link
    fedilink
    English
    26 hours ago

    Maybe you could submit an issue to the repo to include a way to change the format of the saved folders.
    (I’m thinking something similar on how immich allows to change some formats)

    I’m seeing in my instance the names seem like some sort of timestamp, not sure if the code uses them in a meaningful way, so probably the solution would be to create symlinks with the name of the site or some other format while keeping the timestamp so the rest of the code can still expect it.