https://data-analysis.fedoraproject.org/csv-reports/countme/
Hasn't updated in two weeks. It should update every Thursday. Please add some monitoring to check. The file date should be sufficient -- but looking at the week of the last entry in totals.db with sqlite3 would be even better.
Metadata Update from @phsmoura: - Issue priority set to: Waiting on Assignee (was: Needs Review) - Issue tagged with: high-gain, medium-trouble
Metadata Update from @smooge: - Issue assigned to smooge
Looking at the directories, it looks like someone is trying to fix the backlog of data, but I am not sure. There is also a very long running process
countme 1707020 0.0 0.0 12852 2376 pts/0 S+ Sep24 0:07 /bin/sh -e ./mirrors-countme/scripts/countme-regenerate-dbs.sh
Not sure if that is a one and done script which should hve ended a long time ago or one which is always on
Metadata Update from @smooge: - Issue untagged with: high-gain, medium-trouble - Issue assigned to nphilipp (was: smooge) - Issue priority set to: Needs Review (was: Waiting on Assignee)
Metadata Update from @smooge: - Issue priority set to: Waiting on Assignee (was: Needs Review)
Metadata Update from @smooge: - Issue tagged with: high-gain, medium-trouble, ops
BTW, this is fixed, but there may be a gap we are still investigating.
The long running script is the import of all the old data for the new unique IP feature. It's importing into *-new.db files, so shouldn't affect any of the production processes.
So, whats left here? a nagios check on the file age of something in that dir? Should it alert after a day? more? Or is there any better way to monitor this and confirm it's still working ?
Metadata Update from @zlopez: - Issue assigned to dkirwan (was: nphilipp)
Hey @james any thoughts here on how we could monitor that things are working right?
For the output, monitoring the file timestamps on: https://data-analysis.fedoraproject.org/csv-reports/countme/
...should be a start, although if we can get it to do more that'd be nice. Looking into adding things to Fedora zabbix.
We are monitoring mtime's of a bunch of the files now...
https://nagios.fedoraproject.org/nagios/cgi-bin//status.cgi?host=log01.iad2.fedoraproject.org
Metadata Update from @james: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Log in to comment on this ticket.