Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21215

Read Parquet INT64 timestamp

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.0.0-alpha-1
    • None
    • None

    Description

      This patch enables Hive to start reading timestamps from Parquet written with the new semantics:

      With Parquet version 1.11, a new timestamp LogicalType with base INT64 and the following metadata is introduced:

      • boolean isAdjustedToUtc: marks whether the timestamp is converted to UTC (aka Instant semantics) or not (LocalDateTime semantics).
      • enum TimeUnit (NANOS, MICROS, MILLIS): granularity of timestamp

      Upon reading, the semantics of these new timestamps will be determined by their metadata, while the semantics of INT96 timestamps will continue to be deduced from the writer metadata.
      This feature will be behind a flag for now.

      Attachments

        1. HIVE-21215.1.patch
          18 kB
          Marta Kuczora
        2. HIVE-21215.2.patch
          19 kB
          Marta Kuczora

        Issue Links

          Activity

            People

              kuczoram Marta Kuczora
              klcopp Karen Coppage
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: