#7635 Add monitoring for httpd on certgetter
Closed: Fixed 9 months ago by kevin. Opened 9 months ago by mizdebsk.

httpd on certgetter was stopped for more than 2 weeks (from Feb 27 to Mar 13) and we did not get any Nagios alerts about that. This eventually lead to outage that would've been prevented if we had monitoring set up.


Hi,

first patch in fedora-infrastructure!

From 191cd95874e5f2f0141032cc211f2b405255df6d Mon Sep 17 00:00:00 2001
From: Alessandro Lorenzi <alorenzi@alorenzi.eu>
Date: Sat, 16 Mar 2019 22:10:12 +0100
Subject: [PATCH] Monitoring: add service certgetter http

Refs: #7635
---
 roles/nagios_server/files/nagios/services/certgetter.cfg | 6 ++++++
 1 file changed, 6 insertions(+)
 create mode 100644 roles/nagios_server/files/nagios/services/certgetter.cfg

diff --git a/roles/nagios_server/files/nagios/services/certgetter.cfg b/roles/nagios_server/files/nagios/services/certgetter.cfg
new file mode 100644
index 000000000..2b143ed23
--- /dev/null
+++ b/roles/nagios_server/files/nagios/services/certgetter.cfg
@@ -0,0 +1,6 @@
+define service {
+  host_name             certgetter01.phx2.fedoraproject.org
+  service_description   certgetter-http
+  check_command         check_http!certgetter01.phx2.fedoraproject.org
+  use                   defaulttemplate
+}
-- 
2.17.1

@neron awesome! Thanks for the patch...

However, we are in Beta freeze right now and noc01 (where this would be) is frozen. :broken_heart:

You can however send to the fedora-infrastructure list a freeze break request to apply this anyhow.
Then when/if it gets 2 +1s I can apply it for you...

Or we can just wait until after freeze.

hi, this should be closed by commit d9d24d08d9d26f66791ed0b6934f9e3f3520640a

Yep. Thanks much for the patch!

:mega:

Metadata Update from @kevin:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

9 months ago

Login to comment on this ticket.

Metadata