Uploaded image for project: 'Data Movement as a Platform'
  1. Data Movement as a Platform
  2. DMAAP-844

DMAAP service fails from time to time and can not be restored by helm deploy after failure

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Medium Medium
    • Casablanca Release
    • Casablanca Release
    • OOM on Windriver lab

      On OOM deployment, DMAAP failure happens from time to time. When DMAAP fails, published event can not be received by subscriber and service distribution fails. 

      Worse thing is DMAAP service can not be restored reliably after failure by helm deploy command like the following:

      root@oom-rancher:~# helm delete dev-dmaap --purge

      root@oom-rancher:~# kubectl -n onap delete pvc dev-dmaap-dbc-pg-data-dev-dmaap-dbc-pg-0
      persistentvolumeclaim "dev-dmaap-dbc-pg-data-dev-dmaap-dbc-pg-0" deleted
      root@oom-rancher:~# kubectl -n onap delete pvc dev-dmaap-dbc-pg-data-dev-dmaap-dbc-pg-1
      persistentvolumeclaim "dev-dmaap-dbc-pg-data-dev-dmaap-dbc-pg-1" deleted

      root@oom-rancher:~# helm deploy dev local/onap -f /root/integration-override.yaml --namespace onap

      No known workaround to restore the service reliably at this point except reinstalling ONAP. 

      The message-router application.log is attached. 

       

            sunil.unnava sunil.unnava
            xuyang11 xuyang11
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: