Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-4156

Fix Tez to reuse IPC connections

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.10.0
    • None
    • None

    Description

      When tracking DAG progress, TezClientUtils ends up creating new remote user. Because of this new UGI creation, IPC connections are not reused internally.

      https://github.com/apache/tez/blob/master/tez-api/src/main/java/org/apache/tez/client/TezClientUtils.java#L965

      More info from Hadoop side:

      In hadoop's IPC layer, connectionIds are checked based on UserGroupInformation.
      https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Client.java#L1600

      However, UserGroupInformation comparison is based on ==
      https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/UserGroupInformation.java#L1789

      Attachments

        1. TEZ-4156.4.patch
          12 kB
          Rajesh Balamohan
        2. TEZ-4156.3.patch
          11 kB
          Rajesh Balamohan
        3. TEZ-4156.2.patch
          10 kB
          Rajesh Balamohan
        4. TEZ-4156.1.patch
          0.9 kB
          Rajesh Balamohan

        Activity

          People

            rajesh.balamohan Rajesh Balamohan
            rajesh.balamohan Rajesh Balamohan
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: