I would guess it’s A LOT smaller than you’d expect. Especially if you’re just talking about posts and comments and not any uploaded images. The images themselves I can guarantee you is probably many orders of magnitude greater than the size of the conversations.
The post is a few years old and is quoting data that is a few years older still… but assuming that they’ve doubled in size since, there’s only 10TB of data for text, comments, etc… (i.e. no images).
Now I’m assuming this is compressed btw. (The link in the post is dead so I can’t actually check out the file and see what’s in there).
Thanks. Wonder how much it would be uncompressed but I guess that doesnt matter as you can probably compress it on your fediverse instance until it is required to be accessed by a user
I wonder what the total data storage size is for all the publicly viewable content on reddit. I find it hard to even guess lol. 100TB? 10,000TB?
I would guess it’s A LOT smaller than you’d expect. Especially if you’re just talking about posts and comments and not any uploaded images. The images themselves I can guarantee you is probably many orders of magnitude greater than the size of the conversations.
Btw I did just find this: https://www.reddit.com/r/DataHoarder/comments/pqxs8m/size_of_reddit/
The post is a few years old and is quoting data that is a few years older still… but assuming that they’ve doubled in size since, there’s only 10TB of data for text, comments, etc… (i.e. no images).
Now I’m assuming this is compressed btw. (The link in the post is dead so I can’t actually check out the file and see what’s in there).
The compressed archive of reddit from 2005.5 until 2022 is 2 TB: https://academictorrents.com/details/7c0645c94321311bb05bd879ddee4d0eba08aaee
Uncompressed it is likely way larger though.
Thanks. Wonder how much it would be uncompressed but I guess that doesnt matter as you can probably compress it on your fediverse instance until it is required to be accessed by a user