Uploaded image for project: 'Tajo (Retired)'
  1. Tajo (Retired)
  2. TAJO-2063

Refactor FileTablespace::commitOutputData.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • Storage
    • None

    Description

      FileTablespace::commitOutputData has some problems as following:

      First, it is too long and complexed because it handles various cases in a single method. We need to refactor this method into several small and well-defined methods.

      Second, FileSystem::listStatus is widely used while committing output data. It especially causes a lot of overhead to list partitioned directories in S3. Also it occurs in HDFS too with large partitioned tables. We need to minimize its usage.

      Attachments

        Issue Links

          Activity

            People

              blrunner JaeHwa Jung
              blrunner JaeHwa Jung
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: