Federation troubleshooting

Ruud · 2 years ago

Federation troubleshooting

tal · 2 years ago

Thanks for your work and sharing results!

I think that kbin and lemmy are going to ultimately have to record per-instance response time and back off on a given instance. Like, if another instance is failing or overloaded, it’s going to have to reduce the frequency with which it attempts to communicate with that instance, to avoid having a ton of workers tied up trying to communicate with that instance.

The Quuuuuill · 2 years ago

I’d probably recommend exponential backoff with a low max retries

@NuclearArmWrestling · 2 years ago

Ideally, multiple instances could band together and create something like a hub that they all push and pull from. It’s a little more centralization, but would likely significantly reduce overall network and CPU consumption.