Zpool drive replacement help

MrToast72🍞 · 1 year ago

Zpool drive replacement help

@mea_rah · 1 year ago

Unless the lsblk output is wrong, you have single drive RAID0 configured on all of the other drives. I’m not sure why would anyone do that, I’d expect all of them to be set up in similar fashion as the sdc is. It would explain the different device name.

The speed difference might or might not be same issue. It might be completely separate thing. Like a different drive record size or something like that, so it might be a good idea to troubleshoot each problem separately.

MrToast72🍞 · 1 year ago

I use to have it on a LSI raid card a long time ago before switching to a hba card. I had each drive passed through the lsi card as a single disk raid and then I used zfs to create a pool. I’m guessing this is what caused this now that I think about it.

I have like 9tb on this pool so moving everything off of it and then redoing the pool would be currently impossible so I wonder how I would fix this? Replace one drive at a time with some of my spares and swap them around?

@mea_rah · 1 year ago

Yeah one at a time would work, but it would be quite a bit of writing to rotate all.

As for the performance, are you replacing the failed drive with the same model or did you use a different one?

MrToast72🍞 · 1 year ago

Is it possible to replace a disk with the same disk? Like effectively wiping a disk and replacing it with its self so I don’t have to use up my spare drives as rotation drives and add needless wear to them?

I made sure to replace the dead drive with the exact same drive as the rest. All of them are Seagate 7200rpm 3tb SAS drives.

@calamityjanitor · 1 year ago

I’m not familiar with ZFS on Linux, but what is 9173635512214770897 referencing? The command is usually zpool replace pool device [new_device] So if you physically swapped out the old disk and put in a new one, you only need to specify the new disk. If you leave the old one plugged it you list both (old one first).

I don’t know what best practice is for specifying disks to ZFS on Linux, but arch wiki suggests not using /dev/sdc, but the ID instead https://wiki.archlinux.org/title/ZFS#Identify_disks

Also you don’t need to offline the pool to replace a disk, you can keep using it as it resilvers.

MrToast72🍞 · 1 year ago

The number you’re talking about is in reference to the old disk. I had swapped out the old drive and when I tried to run the command

Pool replace pool device [new_device]

It yelled at me that I needed to reference the old drive as well? So I’m not sure why it didn’t work the way you said it should

@calamityjanitor · 1 year ago

Huh annoying. You can run zdb -C TheMass To get more info about the pool and the disks in it. Might list enough disk detail to give you confidence it’s using the layout you want.

For me identifying disks usually ends up being unplugging them one by one and checking which shows OFFLINE. Could be worth the trouble to know for sure its specifying and using the disks.

In any case a good time to setup a backup for anything you can’t replace.

MrToast72🍞 · 1 year ago

Yeah that’s what I’m thinking of doing now, backup everything important just in case and continue to work on it.

Thankfully I can identify any disk rather easily with the command you mentioned (used it before to grab a drives serial number which is printed on the drive its self).