Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-2503

Lower shuffle output buffer (spark.shuffle.file.buffer.kb)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • Shuffle, Spark Core

    Description

      Each shuffle output stream uses a 100KB buffer. This is simply too large. For 10k reducers, 32 cores, that adds up to 32GB in buffer alone.

      The change is simple, but we should make sure there isn't much performance regression when lowering the buffer size.

      Attachments

        Issue Links

          Activity

            People

              rxin Reynold Xin
              rxin Reynold Xin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: