Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-6377

Text extraction with oak-run and tika requires fake string in the command to work

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.7.2, 1.8.0
    • 1.7.3, 1.8.0
    • lucene, run
    • None

    Description

      According to the text-extraction documentation there is currently no need to set a segmentstore for the extract command.

          java -cp tika-app-1.15.jar:oak-run.jar \
          org.apache.jackrabbit.oak.run.Main tika \
          --data-file binary-stats.csv \
          --store-path ./store  \
          --fds-path /path/to/datastore  --extract
      

      The command parser expects a string option so the workaround for this would be to provide a fake string at the end.
      e.g:

      java -cp .......... --extract fakestore
      

      Attachments

        Issue Links

          Activity

            People

              chetanm Chetan Mehrotra
              chibulcu Paul Chibulcuteanu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: