Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-12171

Slow CSV output in Impala-shell

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Clients
    • None
    • ghx-label-12

    Description

      Delimited output is much faster then pretty printing, but can still be the bottleneck when printing large result sets (50-80% of CPU time in impala-shell).

      Was able to significantly improve performance by rewriting adding custom CSV writer to DelimitedOutputFormatter:
      https://gerrit.cloudera.org/#/c/19894/

      Attachments

        Activity

          People

            Unassigned Unassigned
            csringhofer Csaba Ringhofer
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: