Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3736

Workers should reconnect to Master if disconnected

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 1.0.2, 1.1.0
    • 1.2.0
    • Spark Core
    • None

    Description

      In standalone mode, when a worker gets disconnected from the master for some reason it never attempts to reconnect. In this situation you have to bounce the worker before it will reconnect to the master.

      The preferred alternative is to follow what Hadoop does – when there's a disconnect, attempt to reconnect at a particular interval until successful (I think it repeats indefinitely every 10sec).

      This has been observed by:

      Attachments

        Issue Links

          Activity

            People

              mccheah Matthew Cheah
              aash Andrew Ash
              Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: