Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-1420

Indexer for the Freebase knowledge base

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Cannot Reproduce
    • entityhub-0.11.0
    • 1.0.0
    • Entityhub
    • Mac OS Yosemite 10.10.3
    • Important

    Description

      Hi,

      I am working on the HEAD branch of the stanbol repo and observed following while building freebase entityhub indexing.

      1. target directory contains following jars

      directory: stanbol/entityhub/indexing/freebase/target
      org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar
      org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar
      original-org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar

      2. per documentation copy these jars to some place where freebase data resides
      For ex. /tmp/freebase/

      3. Initialize configuration by issuing following command
      The configuration can be initialized with the defaults by calling

      java jar org.apache.stanbol.entityhub.indexing.freebase*.jar init

      The above results into an error stating

      "no main manifest attribute, in org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar"

      If I change it to "java -jar org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar init" it works however it might not be generating all the files as it generates errors in next step.

      4. java -jar -Xmx32g org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar index => This command generates error as follows:

      Exception in thread "Thread-2" 00:02:35,705 [Thread-0] INFO solryard.SolrYardIndexingDestination - ... copy Solr Configuration form /private/tmp/freebase/freebase-index/indexing/config/freebase to /private/tmp/freebase/freebase-index/indexing/destination/indexes/default/freebase
      java.lang.IllegalStateException: The file with the Entity Scores is missing
      at org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:476)
      at org.apache.stanbol.entityhub.indexing.core.impl.IndexingSourceInitialiser.run(IndexingSourceInitialiser.java:43)
      at java.lang.Thread.run(Thread.java:745)
      Caused by: java.io.FileNotFoundException: /private/tmp/freebase/freebase-index/indexing/resources/incoming_links.txt (No such file or directory)
      at java.io.FileInputStream.open(Native Method)
      at java.io.FileInputStream.<init>(FileInputStream.java:146)
      at org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:474)

      Attachments

        Activity

          People

            Unassigned Unassigned
            shahrajan Rajan Shah
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: