Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12971 Hive Support for Kudu
  3. HIVE-22362

Support key-range splitting by size the HiveKuduInputFormat

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      In order to allow for more parallelism and predictable task sizes we should support Kudu key range splitting to allow more parallel tasks per tablet. Without this the parallelism is limited by the number of tablets to scan.

      The implementation is like similar to the Spark implementation here:
      https://github.com/apache/kudu/commit/22a6faa44364dec3a171ec79c15b814ad9277d8f

      Attachments

        Activity

          People

            Unassigned Unassigned
            granthenke Grant Henke
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: