Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-2392

[DocumentMK] Garbage Collect older revisions of binary properties in main document

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 1.5.1, 1.6.0
    • mongomk
    • None

    Description

      Current GC logic for DocumentMK only collects certain types of garbage (see OAK-1981) and currently only split documents are removed. While complete full blow gc would take time and yet not fully implemented we should handle those documents which have binary properties and those properties get updated few times (but not very frequently).

      For e.g. performing a reindex for Lucene index would lead to removal of index files nodes and again creation of nodes with same name. In such a case the older revision of binary property would remain in main document and would not be eligible for gc as per current impl.

      As a fix the GC logic should look for document which might have binaries and then remove the older revisions of binary properties. Currently we do scan all such documents for Blob GC.

      So this can be done either as part of Revision GC or Blob GC

      Attachments

        Issue Links

          Activity

            People

              mreutegg Marcel Reutegger
              chetanm Chetan Mehrotra
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: