Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21722

REPL:: logs are missing in hiveStatement.getQueryLog output during parallel execution mode.

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      getQueryLog only reads logs from Background thread scope. If parallel execution is set to true, a new thread is created for execution and all the logs added by the new thread are not added to the parent  Background thread scope. In replication scope, replStateLogTasks are started in parallel mode causing the logs to be skipped from getQueryLog scope. 

      There is one more issue, with the conf is not passed while creating replStateLogTask during bootstrap load end. The same issue is there with event load during incremental load. The incremental load end log task is created with the proper config. 

      Attachments

        1. HIVE-21722.02.patch
          12 kB
          mahesh kumar behera
        2. HIVE-21722.01.patch
          10 kB
          mahesh kumar behera

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            maheshk114 mahesh kumar behera Assign to me
            maheshk114 mahesh kumar behera
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - Not Specified
              Not Specified
              Remaining:
              Remaining Estimate - 0h
              0h
              Logged:
              Time Spent - 3h
              3h

              Slack

                Issue deployment