Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-9412

Avoid unnecessary resource transfer from probe side to output for right semi joins

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Trivial
    • Resolution: Unresolved
    • None
    • None
    • Backend

    Description

      In the join nodes, include NestedLoopJoinNode and PartitionedHashJoinNode, we transfer memory from the left child batches to the output, in case the output batch references the input data. This is not necessary for all join mode, e.g. RIGHT SEMI and RIGHT ANTI joins that only return row data from the right child. This results in unnecessary amounts of memory flowing up the plan tree. It's also a little undesirable in that it makes it unclear why the memory is transferred.

      E.g. see https://github.com/apache/impala/blob/0bb056e/be/src/exec/partitioned-hash-join-node.cc#L657

      Attachments

        Activity

          People

            Unassigned Unassigned
            tarmstrong Tim Armstrong
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: