Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-4201

Add an index of binary references in a tar file

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • Segment Tar 0.0.4
    • segment-tar
    • None

    Description

      Currently for Blob GC in case of segment SegmentBlobReferenceRetriever goes through all tar files and extracts the binary references. This has 2 issues

      1. Logic has go through all the segments in all tar files
      2. All segments get loaded in memory once which would affect normal system performance

      This process can be optimized if we also write a file entry in tar (similar to gph i.e. graph and idx i.e. index files) which has entries of all binary references referred to in any segment present in that tar file. Then GC logic would just have read this file and avoid scanning all the segments

      Attachments

        1. OAK-4201-01.patch
          51 kB
          Francesco Mari

        Issue Links

          Activity

            People

              frm Francesco Mari
              chetanm Chetan Mehrotra
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: