Issue #992: Replace EC2 node in new OCP CI cluster in AWS - centos-infra

centos-infra

#992 Replace EC2 node in new OCP CI cluster in AWS

Closed: Fixed with Explanation a year ago by arrfab. Opened a year ago by arrfab.

We got a mail notification from AWS that there was an underlying issue on one the compute/worker nodes in the new ocp cluster.
We need to cordon/drain the node, evacuate pods, follow aws doc to have node stopped/restarted on other infra, and then add it back (uncordon) to ocp cluster after verification

Metadata Update from @arrfab:
- Issue assigned to arrfab

a year ago

Metadata Update from @arrfab:
- Issue tagged with: centos-ci-infra, high-gain, medium-trouble

a year ago

arrfab commented a year ago

Node was temporary removed so all openshift pods are scheduled/migrated to remaining workers.
When trying to follow as procedure, it doesn't come back due to InsufficientInstanceCapacity error.
I'll have a look at that and then add node back in ocp cluster when it's possible to provision/restart it

arrfab commented a year ago

it seems there is now enough capacity in region/availability zone to have instance back and running.
Added back to openshift so ocp.cloud.ci.centos.org ocp cluster running now as normal

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed with Explanation
- Issue status updated to: Closed (was: Open)

a year ago

Metadata

Assignee

arrfab

Tags

Blocking

None

Depending on

None

Priority

🔥 Urgent 🔥

Boards 1

CentOS CI Infra Status: Backlog

centos-infra

Source Code

#992 Replace EC2 node in new OCP CI cluster in AWS Closed: Fixed with Explanation a year ago by arrfab. Opened a year ago by arrfab.

Metadata

medium-trouble high-gain centos-ci-infra

Boards 1

#992 Replace EC2 node in new OCP CI cluster in AWS

Closed: Fixed with Explanation a year ago by arrfab. Opened a year ago by arrfab.