Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-22528

Bloom Filter not showing up in Explain plan

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 3.1.0
    • None
    • Hive
    • None
    • Test Environment.

    Description

      Hi Team,

      We are using Hive version (Apache Hive (version 3.1.0.3.1.0.0-78) and trying to implement Bloom filter in it. So basically I have created a managed table with table properties defined as:

      'orc.bloom.filter.columns'='*******',  'orc.bloom.filter.fpp'='0.05',  'orc.stripe.size'='268435456',

      and stored it as orc file. While checking the explain plan(running: explain select count(1) from the_table where <condition>) in the current Hive version, I couldn't see anything as "Bloom_Filter" in the Plan provided by the CBO. The table I'm querying data in has  records.

       

      I have a few doubts:

      1. Is Hive 3.1 version not using Bloom filter? If so, I have queried a normal table with same query and condition have seen that it takes more time compared to a table having Bloom filter defined on the column that has condition.
      2. Is there any parameter that needs to be set to get the value/ Bloom filter in the table?
      3. I have come across three parameters, please let me know what does these signify : 
        hive.tez.max.bloom.filter.entries,hive.tez.min.bloom.filter.entries,hive.tez.bloom.filter.factor

      Please let me know if anyone has used Bloom filter. Let me know then the process

      Attachments

        Activity

          People

            Unassigned Unassigned
            Sinha_Subhajit_176 Subhajit Sinha
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: