#6505 koji_wellness nagios plugin
Closed: Fixed 4 years ago by cverna. Opened 6 years ago by kevin.

We would like a nagios plugin script that would check all the following:

If any of these fail, return a error state to nagios with a message on why.


Hello, is anyone working on this issue? I would like to work on it as my first contribution :)

Not that I know of. Please take it and work on it.

Metadata Update from @nb:
- Issue assigned to lrossetti

6 years ago

Metadata Update from @kevin:
- Issue priority set to: Waiting on Asignee
- Issue tagged with: monitoring

6 years ago

Attaching the patch file.

I was able to test it on my local nagios instance but I didn't use the ansible playbooks so I may need some help with regards to e2e testing.

6505.patch

I appologize for how long it's taken to get back to this. :frowning:

This plugin looks pretty great!

In calling the check though we need to probibly use koji.fedoraproject.org instead of koji01.phx2.fedoraproject.org so we can make sure and test the endpoint on the proxies that everyone hits (and could go to koji01 or koji02).

Otherwise it looks great. You want to change that in the patch before I apply, or shall I just do so after I apply it?

I applied this, but we need more adjustment for the hostname change I'm afraid.

koji.fedoraproject.org isn't a valid host on the internal nagios. :(

I reverted it back until you or I have more time to look.

@kevin what do you think if we add a default var to store a list of koji hostnames?

Well, I don't think it will work for all of them... but I guess yeah, we could for those that it would work fine. (all of the names koji has that is).

Metadata Update from @cverna:
- Assignee reset
- Issue tagged with: backlog

4 years ago

Metadata Update from @cverna:
- Issue assigned to cverna

4 years ago

So I went ahead and just changed the hostname in the service definition (https://infrastructure.fedoraproject.org/cgit/ansible.git/commit/?id=82208b3d42de914ddf23333fc44fd6230608b23a)

That way the plugin still checks for koji.fedoraproject.org hitting the proxy (http requests) but nagios reports any problem with this service on the koji01.fp.o host.

This is now deployed and running. I ll close this ticket but we can reopen it if the above solution does not work.

Metadata Update from @cverna:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

4 years ago

Login to comment on this ticket.

Metadata
Attachments 3
Attached 6 years ago View Comment
Attached 5 years ago View Comment
Attached 5 years ago View Comment