Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15789

Vectorization: limit reduce vectorization to 32Mb chunks

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.3.0
    • Vectorization
    • None
    • Vectorization: limit reduce vectorization to 32Mb chunks (Teddy Choi, via Gopal V)

    Description

      Reduce vectorization accumulates 1024 rows before forwarding it into the reduce processor.

      Add a safety limit for 32Mb of writables, so that shorter sequences can be forwarded into the operator trees.

              rowIdx++;
              if (rowIdx >= BATCH_SIZE) {
                VectorizedBatchUtil.setBatchSize(batch, rowIdx);
                reducer.process(batch, tag);
      

      Attachments

        1. HIVE-15789.2.patch
          4 kB
          Teddy Choi
        2. HIVE-15789.1.patch
          4 kB
          Teddy Choi

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            teddy.choi Teddy Choi Assign to me
            gopalv Gopal Vijayaraghavan
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment