Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-13108

RelatednessAgg ignores cacheDf, consults filterCache for every bucket/term

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 7.4, 8.0
    • None
    • Facet Module
    • None

    Description

      The relatedness aggregation function in JSON facet API ignores cacheDf setting and consults the filterCache for every bucket. This is ok e.g. for "Query" facet type, where buckets are explicitly enumerated (and thus probably relatively low cardinality). But for "Terms" facet type, where bucket count is determined by the corpus, this can be a problem. When used over even modestly high-cardinality fields, this is very likely to blow out the filterCache.

      See also issue with similar consequences: SOLR-9350

      Attachments

        1. SOLR-13108.patch
          6 kB
          Michael Gibney
        2. SOLR-13108-shortcircuit.patch
          7 kB
          Michael Gibney

        Issue Links

          Activity

            People

              Unassigned Unassigned
              magibney Michael Gibney
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: