#854 The C9S x86_64 pool in CentOS CI is in a questionable state
Closed: Fixed 2 years ago by arrfab. Opened 2 years ago by mrc0mmand.

Hi!

I noticed that vast majority of C9S x86_64 machines in the Duffy pool is in Failed/Hardware_Failure state, and only two of them remain operational:

$ ./agent-control.py --list=all |& grep 9-stream
INFO: 151 n24.crusty 172.19.2.24     crusty   6786 Failed           None     None None 9-stream   x86_64        0  2230 None    
INFO: 178 n51.crusty 172.19.2.51     crusty   6284 Deployed         4fedea80 None None 9-stream   x86_64        1  2500 None    
INFO: 181 n54.crusty 172.19.2.54     crusty   6284 Deployed         e43de48b None None 9-stream   x86_64        1  2530 None    
INFO: 191 n64.crusty 172.19.2.64     crusty   6786 Failed           None     None None 9-stream   x86_64        0  2630 None    
INFO: 206 n15.dusty  172.19.2.79     dusty    6786 Failed           None     None None 9-stream   x86_64        0  2140 None    
INFO: 256 n1.gusty   172.19.2.129    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2000 None    
INFO: 257 n2.gusty   172.19.2.130    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2010 None    
INFO: 258 n3.gusty   172.19.2.131    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2020 None    
INFO: 259 n4.gusty   172.19.2.132    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2030 None    
INFO: 260 n5.gusty   172.19.2.133    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2040 None    
INFO: 261 n6.gusty   172.19.2.134    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2050 None    
INFO: 263 n8.gusty   172.19.2.136    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2070 None    
INFO: 264 n9.gusty   172.19.2.137    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2080 None    
INFO: 265 n10.gusty  172.19.2.138    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2090 None    
INFO: 266 n11.gusty  172.19.2.139    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2100 None    
INFO: 269 n14.gusty  172.19.2.142    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2130 None    
INFO: 270 n15.gusty  172.19.2.143    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2140 None    
INFO: 277 n22.gusty  172.19.2.150    gusty    5734 Hardware_failure None     None None 9-stream   x86_64        1  2210 None    
INFO: 297 n42.gusty  172.19.2.170    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2410 None    
INFO: 298 n43.gusty  172.19.2.171    gusty    6786 Failed           None     None None 9-stream   x86_64        0  2420 None    
INFO: 314 n59.gusty  172.19.2.187    gusty    5698 Hardware_failure None     None None 9-stream   x86_64        1  2580 None    
INFO: 356 n1.aah2    172.19.3.129    aah2       36 Active           None     None None 9-stream   aarch64       0     0 medium  
INFO: 367 n3.aah3    172.19.3.140    aah3       46 Ready            None     None None 9-stream   aarch64       1     0 medium  
INFO: 368 n4.aah3    172.19.3.141    aah3       45 Active           None     None None 9-stream   aarch64       0     0 medium  
INFO: 371 n7.aah3    172.19.3.144    aah3        9 Ready            None     None None 9-stream   aarch64       1     0 medium  
INFO: 372 n8.aah3    172.19.3.145    aah3        9 Ready            None     None None 9-stream   aarch64       1     0 medium  

I know the gusty chassis is out of commission, but would it be possible to bump the number of available C9S machines a bit?

Thanks!


There was a mail about migration to new version of Duffy. This should solve this issue.

Metadata Update from @zlopez:
- Issue priority set to: Waiting on Reporter (was: Needs Review)

2 years ago

Metadata Update from @arrfab:
- Issue tagged with: centos-ci-infra

2 years ago

closing, per discussion with @mrc0mmand on #centos-ci irc channel

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

2 years ago

Login to comment on this ticket.

Metadata
Boards 1
CentOS CI Infra Status: Backlog