We recently suffered from a race condition in which our pdns nodes (having delegation for some A/AAAA records like mirror.stream.centos.org and others) had issues fetching authoritative backend file to reload zone in memory and so were still "functional" (from zabbix PoV) but serving outdated records as some were recently disabled but still appearing in the geoip round-robin dns setup.
While incident is already identified and fixed, we just need to add simple monitoring notification for when backend file can't be pulled and reloaded on pdns nodes
Metadata Update from @arrfab: - Issue assigned to arrfab
Metadata Update from @arrfab: - Issue tagged with: centos-common-infra, centos-stream, high-gain, low-trouble
The following commit implements zabbix trigger on backend refresh issue (tested and deployed)
Metadata Update from @arrfab: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Log in to comment on this ticket.