httpd on certgetter was stopped for more than 2 weeks (from Feb 27 to Mar 13) and we did not get any Nagios alerts about that. This eventually lead to outage that would've been prevented if we had monitoring set up.
Hi,
first patch in fedora-infrastructure!
From 191cd95874e5f2f0141032cc211f2b405255df6d Mon Sep 17 00:00:00 2001 From: Alessandro Lorenzi <alorenzi@alorenzi.eu> Date: Sat, 16 Mar 2019 22:10:12 +0100 Subject: [PATCH] Monitoring: add service certgetter http Refs: #7635 --- roles/nagios_server/files/nagios/services/certgetter.cfg | 6 ++++++ 1 file changed, 6 insertions(+) create mode 100644 roles/nagios_server/files/nagios/services/certgetter.cfg diff --git a/roles/nagios_server/files/nagios/services/certgetter.cfg b/roles/nagios_server/files/nagios/services/certgetter.cfg new file mode 100644 index 000000000..2b143ed23 --- /dev/null +++ b/roles/nagios_server/files/nagios/services/certgetter.cfg @@ -0,0 +1,6 @@ +define service { + host_name certgetter01.phx2.fedoraproject.org + service_description certgetter-http + check_command check_http!certgetter01.phx2.fedoraproject.org + use defaulttemplate +} -- 2.17.1
@neron awesome! Thanks for the patch...
However, we are in Beta freeze right now and noc01 (where this would be) is frozen. :broken_heart:
You can however send to the fedora-infrastructure list a freeze break request to apply this anyhow. Then when/if it gets 2 +1s I can apply it for you...
Or we can just wait until after freeze.
hi, this should be closed by commit d9d24d08d9d26f66791ed0b6934f9e3f3520640a
Yep. Thanks much for the patch!
:mega:
Metadata Update from @kevin: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Login to comment on this ticket.