Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-22448

Scan is slow for Multiple Column prefixes

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Won't Fix
    • 1.4.8, 1.4.9
    • None
    • Scanners

    Description

      While scanning a row (around 10 lakhs columns) with 100 column prefixes, it takes around 4 seconds in hbase-1.2.5 and when the same query is executed in hbase-1.4.9 it takes around 50 seconds.

      Is there any way to optimise this?

       

      P.S:

      We have applied the patch provided in HBASE-21620 and  HBASE-21734 . Attached qualifiers.txt file which contains the column keys. Use the HBaseFileImport.java file provided to populate in your table and use scanquery.txt to query.

      Attachments

        1. scanquery.txt
          3 kB
          Karthick
        2. qualifiers.txt
          26.29 MB
          Karthick
        3. org.apache.hadoop.hbase.filter.TestSlowColumnPrefix-output.zip
          915 kB
          ramkrishna.s.vasudevan
        4. HBaseFileImport.java
          2 kB
          Karthick
        5. filter-list-with-or-internal-2.png
          77 kB
          Zheng Hu
        6. 0001-benchmark-UT.patch
          5 kB
          Zheng Hu

        Activity

          People

            openinx Zheng Hu
            KarthickRam Karthick
            Votes:
            1 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: