Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-26828

Coalesce to reduce partitions before writing to hive is not working

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Cannot Reproduce
    • 2.3.0
    • None
    • Spark Core
    • None

    Description

      final_store.coalesce(5).write.mode("overwrite").insertInto("database.tablename",overwrite = True), this statement is not merging partitions. I've set 

      .config("spark.default.parallelism", "2000") \
      .config("spark.sql.shuffle.partitions", "2000") \

      however repartition is working but takes 20-25 minutes to insert.

      Attachments

        Activity

          People

            Unassigned Unassigned
            anusha39 Anusha Buchireddygari
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: