Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3760

Predicate pushdown for columnar file formats

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.15.0
    • None
    • None

    Description

      From the conversation on dev@pig:

      "Partition pruning for ORC is not addressed in PIG-3558. We will need
      to do partition pruning for both ORC and Parquet in a new ticket.
      Curently there is no interface to deal with this kind of pushdown
      (LoadMetadata.setPartitionFilter push the filter to loader, but remove
      the filter statement, for ORC/Parquet, filter is a hint, and we need
      to do the filter again in Pig even it is pushed to loader), we will
      need to define a new interface for that. You are welcome to initiate
      the work. I know Aniket is also interested in doing that, so be sure
      the talk with him about this work.

      Thanks,
      Daniel

      On Mon, Feb 10, 2014 at 11:42 AM, Andrew Musselman
      <andrew.musselman@gmail.com> wrote:
      > I had a chat with a couple people last week about a feature request for
      > Pig: in a "where" or "filter" clause, when loading an ORC file, to skip
      > directly to the right offset instead of scanning the whole file."

      Attachments

        Issue Links

          Activity

            People

              rohini Rohini Palaniswamy
              andrew.musselman Andrew Musselman
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: