Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-28213

Evaluate using hbase-shaded-client-byo-hadoop for Spark connector

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Done
    • None
    • None
    • spark
    • None

    Description

      Since 3.2 Spark now uses hadoop-client-api and hadoop-client-runtime.
      While we don't actually specify what HBase libraries are needed on the Spark client side for the connector, at least the Cloudera docs specify the classes provided by "hbase mapredcp"
      which includes the full unshaded Hadoop JAR set.

      Investigate whether  hbase-shaded-client-byo-hadoop and the hbase-client-api and hbase-client-runtime is enough for the connector, and if yes, document how to set the Spark classpath.

      Alternatively, if hbase-shaded-client-byo-hadoop  is not enough, check if hbase-shaded-mapreduce plus the above two shaded Hadoop client JAR provides everything needed.

      Attachments

        Issue Links

          Activity

            People

              stoty Istvan Toth
              stoty Istvan Toth
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: