Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-47232

Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.4.2, 3.4.0
    • None
    • None

    Description

      When enabled reverse proxy in master and worker configOptions. We're not able to access different tabs available in spark UI e.g.(stages, environment, storage etc.)

      We're deploying spark through bitnami helm chart : https://github.com/bitnami/charts/tree/main/bitnami/spark

      Name and Version

      bitnami/spark - 6.0.0

      What steps will reproduce the bug?

      Kubernetes Version: 1.25
      Spark: 3.4.2
      Helm chart: 6.0.0

      Steps to reproduce:
      After installing the chart Spark Cluster(Master and worker) UI is available at:


      https://spark.staging.abc.com/

      We are able to access running application by click on applicationID under Running Applications link:

      We can access spark UI by clicking Application Detail UI:

      We are taken to jobs tab when we click on Application Detail UI


      URL looks like: https://spark.staging.abc.com/proxy/app-20240208103209-0030/stages/

      When we click any of the tab from spark UI e.g. stages or environment etc, it takes us back to spark cluster UI page
      We noticed that endpoint changes to 


      https://spark.staging.abc.com/stages/ 
      instead of 
      https://spark.staging.abc.com/proxy/app-20240208103209-0030/stages/

       

      Are you using any custom parameters or values?

      Configurations set in values.yaml
      ```
      master:
        configOptions:
          -Dspark.ui.reverseProxy=true
          -Dspark.ui.reverseProxyUrl=https://spark.staging.abc.com

      worker:
        configOptions:
          -Dspark.ui.reverseProxy=true
          -Dspark.ui.reverseProxyUrl=https://spark.staging.abc.com

      service:
        type: ClusterIP
        ports:
          http: 8080
          https: 443
          cluster: 7077

      ingress:

        enabled: true
        pathType: ImplementationSpecific
        apiVersion: ""
        hostname: spark.staging.abc.com
        ingressClassName: "staging"
        path: /
      ```

       

      What is the expected behavior?

      Expected behaviour is that when I click on stages tab, instead of taking me to 
      https://spark.staging.abc.com/stages/ 
      it should take me to following URL:
      https://spark.staging.abc.com/proxy/app-20240208103209-0030/stages/

      What do you see instead?

      current behaviour is it takes me to URL: https://spark.staging.abc.com/stages/ , which shows spark cluster UI with master and worker details

      Attachments

        1. Screenshot 2024-02-09 at 5.05.48 PM.png
          396 kB
          Sharad Mishra
        2. Screenshot 2024-02-09 at 5.04.49 PM.png
          413 kB
          Sharad Mishra
        3. Screenshot 2024-02-09 at 5.09.28 PM.png
          260 kB
          Sharad Mishra
        4. Screenshot 2024-02-09 at 5.13.01 PM.png
          325 kB
          Sharad Mishra

        Activity

          People

            Unassigned Unassigned
            sharadmishra Sharad Mishra
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: