Uploaded image for project: 'Policy Framework'
  1. Policy Framework
  2. POLICY-501

Fix issues blocking election handler and add directed interface for opstate

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Medium Medium
    • Beijing Release
    • None
    • None
    • None

      This bug tracks the AT&T bug 355533.  The symptom was that drools pdps that were backing each other up were becoming stuck in a standby state.  The cause was that the election handler thread was being hung by a call to PolicyEngine.manager.deactivate which shuts down the topic endpoints when the drools pdp operational state transitions to disabled. 

      Related problems were that the election handler heartbeat was NOT blocked when the main thread was blocked and the IntegrityMonitor forward progress counter was NOT blocked from incrementing when the election handler thread was blocked.  This prevented the correct failover of the drools pdp to another healthy one.

      This change fixes the two causes of the thread blockage, moves the election handler heartbeat to the main thread and adds an interface (AllSeemsWell) which is called when the election handler has stalled/resumed. The AllSeemsWell interface will block forward progress counter increments when ALLNOTWELL and will resume forward progress counter increments when ALLSEEMSWELL.

      In addition, it reduces the run time of the StandbyStateManagementTest from approximately 8 minutes to approximately 2 minutes.

      Since this changes classes also changed by POLICY-444, this change must be merged before POLICY-444 can be merged.

            km097d km097d
            km097d km097d
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: