Issue #10099: bvmhost-x86-01.stg.iad2.fedoraproject.org lost a disk - fedora-infrastructure

fedora-infrastructure

#10099 bvmhost-x86-01.stg.iad2.fedoraproject.org lost a disk

Closed: Fixed 2 years ago by mobrien. Opened 2 years ago by kevin.

[303001.940819] sd 0:2:4:0: [sde] tag#221 CDB: Write(10) 2a 00 00 2e 2f e0 00 00 01 00
[303001.940822] blk_update_request: I/O error, dev sde, sector 3026912 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio class 0
[303001.952803] md: super_written gets error=-5
[303001.958591] md/raid1:md1: Disk failure on sde2, disabling device.
                md/raid1:md1: Operation continuing on 9 devices.

It may be worth rebooting it to see if the disk comes back and can be re-added to the raid.
But likely the disk has failed and we need to get dell to send a replacement and tech to replace it.

@mobrien you want to take this one? or want me to?

Metadata Update from @mohanboddu:
- Issue priority set to: Waiting on Assignee (was: Needs Review)
- Issue tagged with: medium-gain, medium-trouble, ops

2 years ago

kevin commented 2 years ago

I rebooted this friday and got the disk into 'ready' state. It needs another reboot to try and re-add that disk on the raid controller. I'll try and do this today.

Metadata Update from @kevin:
- Issue assigned to kevin

2 years ago

kevin commented 2 years ago

I think we need to reboot and try and re-add the disk.

@mobrien would you like to do this? or want me to?

perhaps we could schedule a time and walk pmoura through it?

kevin commented 2 years ago

I finally looked at this. I tried to re-add the disk but...

Disk 4 in Backplane 1 of Integrated RAID Controller 1 is not functioning correctly.

So, I think we need to call dell and get a replacement. ;(

@mobrien you want to do that? or perhaps we could train up another person or two on this ?