Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-28149

Disable negeative DNS caching

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: In Progress
    • Minor
    • Resolution: Unresolved
    • 3.1.0
    • None
    • Kubernetes, Spark Core
    • None

    Description

      By default JVM caches the failures for the DNS resolutions, by default is cached by 10 seconds.

      Alpine JDK used in the images for kubernetes has a default timout of 5 seconds.

      This means that in clusters with slow init time (network sidecar pods, slow network start up) executor will never run, because the first attempt to connect to the driver will fail, and that failure will be cached, causing  the retries to happen in a tight loop without actually trying again.

       

      The proposed implementation would be to add to the entrypoint.sh (that is exclusive for k8s) to alter the file with the dns caching, and disable it if there's an environment variable as "DISABLE_DNS_NEGATIVE_CACHING" defined. 

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jlpedrosa Jose Luis Pedrosa
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: