#10099 bvmhost-x86-01.stg.iad2.fedoraproject.org lost a disk
Closed: Fixed 2 years ago by mobrien. Opened 2 years ago by kevin.

[303001.940819] sd 0:2:4:0: [sde] tag#221 CDB: Write(10) 2a 00 00 2e 2f e0 00 00 01 00
[303001.940822] blk_update_request: I/O error, dev sde, sector 3026912 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio class 0
[303001.952803] md: super_written gets error=-5
[303001.958591] md/raid1:md1: Disk failure on sde2, disabling device.
                md/raid1:md1: Operation continuing on 9 devices.

It may be worth rebooting it to see if the disk comes back and can be re-added to the raid.
But likely the disk has failed and we need to get dell to send a replacement and tech to replace it.

@mobrien you want to take this one? or want me to?


Metadata Update from @mohanboddu:
- Issue priority set to: Waiting on Assignee (was: Needs Review)
- Issue tagged with: medium-gain, medium-trouble, ops

2 years ago

I rebooted this friday and got the disk into 'ready' state. It needs another reboot to try and re-add that disk on the raid controller. I'll try and do this today.

Metadata Update from @kevin:
- Issue assigned to kevin

2 years ago

I think we need to reboot and try and re-add the disk.

@mobrien would you like to do this? or want me to?

perhaps we could schedule a time and walk pmoura through it?

I finally looked at this. I tried to re-add the disk but...

Disk 4 in Backplane 1 of Integrated RAID Controller 1 is not functioning correctly.

So, I think we need to call dell and get a replacement. ;(

@mobrien you want to do that? or perhaps we could train up another person or two on this ?

download-rdu01 also hasa disk issue

Also virthost-cc-rdu01.fedoraproject.org

[backlog refinement]
There are now 3 machines that needs a disk replacement.

All from this host will be migrated to a new host as this is out of warranty and will no longer be needed

Metadata Update from @mobrien:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

2 years ago

Login to comment on this ticket.

Metadata
Boards 1
ops Status: Backlog