#319 HDD failing in storinator2 (nfs/iscsid) storage box
Closed: Fixed 3 years ago by arrfab. Opened 3 years ago by arrfab.

We have a disk (reported by monitoring) that is failing in the software raid/md device in storinator2 storage box.
That box is used for some NFS exports but also iscsid targets.
One disk is currently failing (not completely failed) so causing remote iscsi initiators to freeze and put fs in read-only mode


Metadata Update from @arrfab:
- Issue assigned to arrfab

3 years ago

Metadata Update from @arrfab:
- Issue tagged with: Business-As-Usual, centos-common-infra, high-gain, medium-trouble

3 years ago

Currently decided to just put the md device in degraded mode, and removed the faulty device (/dev/sdf) from the array.
That workaround solves the remote clients issues but now md device is in degraded mode and no protection anymore.

Let's see what can be done to either replace the HDD (need budget ? ) or just destroy that storinator2 md device completely, rebuild a md array on remaining disk with different settings (raid10) after having moved LUNs on other storage box

There is currently a smartctl running test (extensive one) on HDD to really confirm the issue

"It's Dead Jim" so HDD was removed from array, all services moved to other place (when possible) and array destroyed and recreated with less disks (from raid5 to raid10).
We'll create other ticket for new actions to take but closing this one now (no service is still impacted by this hdd issue anymore)

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

3 years ago

Log in to comment on this ticket.

Metadata
Boards 1