Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-1997

Yunikorn Status page fails when scheduler is not healthy

    XMLWordPrintableJSON

Details

    Description

      when atleast 1 component of scheduler health check status updates to succeeded:"false", status page stops working with error - "Oops, something went wrong. YuniKorn is not accessible."

      Rest Call shows on both 9889 and 9080 port - 

      {"status_code":503,"message":"Scheduler is not healthy","description":"Scheduler is not healthy"}

      All other rest calls work fine on both ports.

      The 503 most likely causes the code in the web UI to shut down without a possibility to start again.
      I do not see any code that would handle/retry the 503. It also does not make sense for me to return that code in this case

      Attachments

        1. patch.txt
          1.0 kB
          Sidharth Mereddy
        2. Before patch.png
          107 kB
          Sidharth Mereddy
        3. After patch.png
          259 kB
          Sidharth Mereddy

        Issue Links

          Activity

            People

              Cliff Su Cliff Su
              sidharthmereddy Sidharth Mereddy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: