Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-19113

Bucketing: Make CLUSTERED BY do CLUSTER BY if no explicit sorting is specified

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The user's expectation of

      "create external table bucketed (key int) clustered by (key) into 4 buckets stored as orc;"

      is that the table will cluster the key into 4 buckets, while the file layout does not do any actual clustering of rows.

      In the absence of a "SORTED BY", this can automatically do a "SORTED BY (key)" to cluster the keys within the file as expected.

      Attachments

        1. HIVE-19113.01.patch
          222 kB
          jcamachorodriguez
        2. HIVE-19113.patch
          22 kB
          jcamachorodriguez

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jcamacho Jesús Camacho Rodríguez Assign to me
            gopalv Gopal Vijayaraghavan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - Not Specified
              Not Specified
              Remaining:
              Remaining Estimate - 0h
              0h
              Logged:
              Time Spent - 1h 20m
              1h 20m

              Slack

                Issue deployment