Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-1458

Out-of-process text extraction for better protection agains JVM/memory/CPU problems

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • lucene, query

    Description

      This is a tracking / collection bug for solving problems with text extraction of
      documents (very large, broken, malicious, etc), causing JVM crashes, memory
      problems, excessive CPU usage.

      The basic TIKA feature to enable this fix is TIKA-416 [1]

      [1] https://issues.apache.org/jira/browse/TIKA-416

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            mmarth Michael Marth
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment