Experiments in Ceph (with Promox)

HTTP_404_NotFound · 2 years ago

Experiments in Ceph (with Promox)

30021190 · 2 years ago

So my production setup is 2x10Gb bonded NICs for networking and 2x10Gb bonded NICs for Ceph/Cluster stuff. I suspect that when ceph is being heavily used you may see bottlenecks however once you have host based failure then in theory your data should be closer to the correct host and not have an issue. But it’s on a basic level like have 3 copies of data, one on each host so it doesn’t save you any storage, just reduces the risks during failure.

Thinking about it, you may actually see better results with ZFS and replicate jobs. As there’s fewer overheads and the ZFS sending is incremental. You’d obviously just loose X minutes of data instead of ceph being X seconds.

HTTP_404_NotFound · 2 years ago

you may actually see better results with ZFS and replicate jobs

Oh, I know the performance is drastically better doing that. I did play with it, and it works for the most part. Performance is dramatically better, but I have peace of mind knowing that is a host just magically craps itself, the data is already ready to go and the machine has already fired up on the new host without any issues.

Also, there is something fun about literally tossing over 6 million IOPs worth of SSDs into my cluster, just to barely squeeze 50k IOPs out of ceph!

I have 5 more “enterprise” NVMes arriving tuesday, which will complete my ceph cluster.

Current, I have 4 of the enterprise SATA SSDs in place, and a single 980 as a placeholder.

Nothing at all to write home about. BUT, I do think the lack of distributed drives is making an impact. My most powerful host, doesn’t have any OSDs yet, still waiting on the NVMe to arrive.

During heavy benchmarking, the limitations of the consumer 980 evo became pretty apparent, when its latency spiked through the moon.

The addition of the new 5 NVMe should make a pretty dramatic difference. If I can squeeze 100k IOPs, I will be happy. (Despite… over 6 million IOPs worth of SSDs…)

Experiments in Ceph (with Promox)

Experiments in Ceph (with Promox)

Cluster Details

Attempt number one.

Attempt / Experiment Number 2.

A few notes-

Future - Attempt #3