#1214 Server status reset in failover might cause an infinite loop
Closed: Fixed None Opened 8 years ago by jhrozek.

A server status in fail over is reset from "not working" to "unknown" after an interval, which is currently hardcoded to 30 seconds. This might be a problem when it takes a long time for individual server tries.

For example, the default krb5_child timeout is 15 seconds, so when there is more than 2 servers in the fail over chain, the first server is always reset to the "default" state when the back end completes a full cycle over the servers.

We should modify the fail over (or the be_fo_ wrapper) to only allow one full cycle over the server list.

Fields changed

milestone: NEEDS_TRIAGE => SSSD 1.8.1 (LTM)
owner: somebody => jhrozek
priority: major => blocker

Fields changed

status: new => assigned

Fixed by:
- master
- b300bad
- 606d2d0
- sssd-1-8
- b3cd4ec
- 8e59877

component: SSSD => Failover
patch: 0 => 1
resolution: => fixed
status: assigned => closed

Metadata Update from @jhrozek:
- Issue assigned to jhrozek
- Issue set to the milestone: SSSD 1.8.1 (LTM)

3 years ago

SSSD is moving from Pagure to Github. This means that new issues and pull requests
will be accepted only in SSSD's github repository.

This issue has been cloned to Github and is available here:
- https://github.com/SSSD/sssd/issues/2256

If you want to receive further updates on the issue, please navigate to the github issue
and click on subscribe button.

Thank you for understanding. We apologize for all inconvenience.

Login to comment on this ticket.