#551 Adding monitoring endpoint for geoip backend replication
Closed: Fixed 3 years ago by arrfab. Opened 3 years ago by arrfab.

We recently suffered from a race condition in which our pdns nodes (having delegation for some A/AAAA records like mirror.stream.centos.org and others) had issues fetching authoritative backend file to reload zone in memory and so were still "functional" (from zabbix PoV) but serving outdated records as some were recently disabled but still appearing in the geoip round-robin dns setup.

While incident is already identified and fixed, we just need to add simple monitoring notification for when backend file can't be pulled and reloaded on pdns nodes


Metadata Update from @arrfab:
- Issue assigned to arrfab

3 years ago

Metadata Update from @arrfab:
- Issue tagged with: centos-common-infra, centos-stream, high-gain, low-trouble

3 years ago

The following commit implements zabbix trigger on backend refresh issue (tested and deployed)

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed
- Issue status updated to: Closed (was: Open)

3 years ago

Log in to comment on this ticket.

Metadata
Boards 1