Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-9410

Support resolving ORC file columns by names

    XMLWordPrintableJSON

Details

    Description

      Currently we resolve ORC file columns by indices. We should provide an query option like PARQUET_FALLBACK_SCHEMA_RESOLUTION for Parquet (IMPALA-2835), to resolve ORC file columns by names.

      Note that Hive only writes column names to ORC files after Hive-2.x (HIVE-4243). For older versions of Hive, the column names in ORC files are something like _col0, _col1,....,_col99. So this feature is only required when deployed with Hive 2+.

      Attachments

        Activity

          People

            gfurnstahl Gergely Fürnstáhl
            stigahuang Quanlong Huang
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: