Issue #59: Determine monitoring requirements for Taskotron - taskotron

taskotron

#59 Determine monitoring requirements for Taskotron

Closed: Fixed None Opened 9 years ago by tflink.

Before we move Taskotron to production, we need to have some monitoring in place so that there are notifications if/when stuff starts going down.

Determine what needs to be monitored and make sure that the various monitoring systems (nagios, most likely) are updated.

tflink commented 9 years ago

The things that come to mind for monitoring are:
http ping for:
* taskotron master landing page
* resultsdb_frontend main page
* restultsdb landing page
ping for
* taskotron-client
* taskotron-[dev,stg,prod]
* resultsdb-[dev,stg,prod]
* qa-db01.qa?
client status
* could be done by scraping the client status page from master, json api from master or commands sent to each client

wishlist:
* make sure that trigger is working (not sure how to do that in a sane fashion)

tflink commented 9 years ago

There is a nagios plugin in the upstream buildbot repo:
https://github.com/buildbot/buildbot/blob/master/master/contrib/check_buildbot.py

tflink commented 9 years ago

monitoring setup ticket filed with fedora infra:
https://fedorahosted.org/fedora-infrastructure/ticket/4541

tflink commented 9 years ago

The montioring setup ticket was closed a while back - closing this

Metadata Update from @tflink:
- Issue tagged with: infrastructure

6 years ago

Metadata

Assignee

None

Tags

Blocking

#57

Move Taskotron into Production

Depending on

None

Priority

Normal

taskotron

Source Code

#59 Determine monitoring requirements for Taskotron Closed: Fixed None Opened 9 years ago by tflink.

Metadata

infra

#59 Determine monitoring requirements for Taskotron

Closed: Fixed None Opened 9 years ago by tflink.