We have a disk (reported by monitoring) that is failing in the software raid/md device in storinator2 storage box. That box is used for some NFS exports but also iscsid targets. One disk is currently failing (not completely failed) so causing remote iscsi initiators to freeze and put fs in read-only mode
Metadata Update from @arrfab: - Issue assigned to arrfab
Metadata Update from @arrfab: - Issue tagged with: Business-As-Usual, centos-common-infra, high-gain, medium-trouble
Currently decided to just put the md device in degraded mode, and removed the faulty device (/dev/sdf) from the array. That workaround solves the remote clients issues but now md device is in degraded mode and no protection anymore.
Let's see what can be done to either replace the HDD (need budget ? ) or just destroy that storinator2 md device completely, rebuild a md array on remaining disk with different settings (raid10) after having moved LUNs on other storage box
There is currently a smartctl running test (extensive one) on HDD to really confirm the issue
smartctl
"It's Dead Jim" so HDD was removed from array, all services moved to other place (when possible) and array destroyed and recreated with less disks (from raid5 to raid10). We'll create other ticket for new actions to take but closing this one now (no service is still impacted by this hdd issue anymore)
Metadata Update from @arrfab: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Log in to comment on this ticket.