Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-20844

Duplicate rows returned while hbase snapshot reads

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 1.3.1
    • None
    • mapreduce, snapshots, spark
    • None
    • Cluster Details

      Java 1.7
      Hbase 1.3.1
      Spark 1.6.1

    Description

      We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records.

      On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used.

      Snapshot was taken during the table was in a region split state.

      We suspect it is due to data is being returned for both parent and daughter regions.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              shivakumar.ss ShivaKumar SS
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: