Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-2724

If an unreadable file is encountered during log aggregation then aggregated file in HDFS badly formed

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.5.1
    • 2.6.0
    • log-aggregation
    • None
    • Reviewed

    Description

      Look into the log output snippet. It looks like there is an issue during aggregation when an unreadable file is encountered. Likely, this results in bad encoding.

      LogType: command-13.json
      LogLength: 13934
      Log Contents:
      Error aggregating log file. Log file : /grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-13.json/grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-13.json (Permission denied)command-3.json13983Error aggregating log file. Log file : /grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-3.json/grid/0/yarn/log/application_1413865041660_0002/contaierrors-13.txt0660_0002_01_000004/command-3.json (Permission denied)
                    errors-3.txt0gc.log-20141021044514484052014-10-21T04:45:12.046+0000: 5.134: [GC2014-10-21T04:45:12.046+0000: 5.134: [ParNew: 163840K->15575K(184320K), 0.0488700 secs] 163840K->15575K(1028096K), 0.0492510 secs] [Times: user=0.06 sys=0.01, real=0.05 secs]
      2014-10-21T04:45:14.939+0000: 8.027: [GC2014-10-21T04:45:14.939+0000: 8.027: [ParNew: 179415K->11865K(184320K), 0.0941310 secs] 179415K->17228K(1028096K), 0.0943140 secs] [Times: user=0.13 sys=0.04, real=0.09 secs]
      2014-10-21T04:46:42.099+0000: 95.187: [GC2014-10-21T04:46:42.099+0000: 95.187: [ParNew: 175705K->12802K(184320K), 0.0466420 secs] 181068K->18164K(1028096K), 0.0468490 secs] [Times: user=0.06 sys=0.00, real=0.04 secs]
      

      Specifically, look at the text after the exception text. There should be two more entries for log files but none exist. This is likely due to the fact that command-13.json is expected to be of length 13934 but its is not as the file was never read.

      I think, it should have been

      LogType: command-13.json
      LogLength: <Length of the exception text>
      Log Contents:
      Error aggregating log file. Log file : /grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-13.json/grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-13.json (Permission denied)command-3.json13983Error aggregating log file. Log file : /grid/0/yarn/log/application_1413865041660_0002/container_1413865041660_0002_01_000004/command-3.json/grid/0/yarn/log/application_1413865041660_0002/contaierrors-13.txt0660_0002_01_000004/command-3.json (Permission denied)
      
      LogType: errors-3.txt
      LogLength:0
      Log Contents:
      
      LogType:gc.log
      LogLength:???
      Log Contents:
      ......-20141021044514484052014-10-21T04:45:12.046+0000: 5.134: [GC2014-10-21T04:45:12.046+0000: 5.134: [ParNew: 163840K- .......
      

      Attachments

        1. YARN-2724.1.patch
          6 kB
          Xuan Gong
        2. YARN-2724.2.patch
          6 kB
          Xuan Gong
        3. YARN-2724.3.patch
          6 kB
          Xuan Gong
        4. YARN-2724.4.patch
          6 kB
          Xuan Gong
        5. YARN-2724.5.patch
          6 kB
          Xuan Gong

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            xgong Xuan Gong Assign to me
            sumitmohanty Sumit Mohanty
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment