#992 Replace EC2 node in new OCP CI cluster in AWS
Closed: Fixed with Explanation a year ago by arrfab. Opened a year ago by arrfab.

We got a mail notification from AWS that there was an underlying issue on one the compute/worker nodes in the new ocp cluster.
We need to cordon/drain the node, evacuate pods, follow aws doc to have node stopped/restarted on other infra, and then add it back (uncordon) to ocp cluster after verification


Metadata Update from @arrfab:
- Issue assigned to arrfab

a year ago

Metadata Update from @arrfab:
- Issue tagged with: centos-ci-infra, high-gain, medium-trouble

a year ago

Node was temporary removed so all openshift pods are scheduled/migrated to remaining workers.
When trying to follow as procedure, it doesn't come back due to InsufficientInstanceCapacity error.
I'll have a look at that and then add node back in ocp cluster when it's possible to provision/restart it

it seems there is now enough capacity in region/availability zone to have instance back and running.
Added back to openshift so ocp.cloud.ci.centos.org ocp cluster running now as normal

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed with Explanation
- Issue status updated to: Closed (was: Open)

a year ago

Login to comment on this ticket.

Metadata
Boards 1
CentOS CI Infra Status: Backlog