jackrabbit-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Jackrabbit Wiki] Update of "QuestionsAndAnswers" by BrentEdwards
Date Tue, 11 Mar 2008 19:58:04 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Jackrabbit Wiki" for change notification.

The following page has been changed by BrentEdwards:
http://wiki.apache.org/jackrabbit/QuestionsAndAnswers

The comment on the change is:
Extractor Dependencies not found.

------------------------------------------------------------------------------
  
  This happens using version 1.3.3, in unpredictable manner. Is it a bug or a feature? Any
suggestion would be useful.
  
+ 
+ === Extractor dependency not found ===
+ 
+ Question: I'm a new Jackrabbit user.  When I run the "FirstHop.java" from "http://jackrabbit.apache.org/first-hops.html",
I get the following warnings (whole log posted; the person answering should feel free to edit
down to the most important lines.)
+ 
+ {{{
+  496 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - Starting repository...
+  578 [main] INFO org.apache.jackrabbit.core.fs.local.LocalFileSystem - LocalFileSystem initialized
at path repository/repository
+  1218 [main] INFO org.apache.jackrabbit.core.nodetype.NodeTypeRegistry - no custom node
type definitions found
+  1236 [main] INFO org.apache.jackrabbit.core.fs.local.LocalFileSystem - LocalFileSystem
initialized at path repository/version
+  2945 [main] INFO org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- Database: Apache Derby / 10.3.2.1 - (599110)
+  2945 [main] INFO org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- Driver: Apache Derby Embedded JDBC Driver / 10.3.2.1 - (599110)
+  4607 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - initializing workspace 'default'...
+  4609 [main] INFO org.apache.jackrabbit.core.fs.local.LocalFileSystem - LocalFileSystem
initialized at path repository/workspaces/default
+  4716 [main] INFO org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- Database: Apache Derby / 10.3.2.1 - (599110)
+  4716 [main] INFO org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- Driver: Apache Derby Embedded JDBC Driver / 10.3.2.1 - (599110)
+  5515 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - workspace 'default' initialized
+  5860 [main] WARN org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor - Extractor
dependency not found: org.apache.jackrabbit.extractor.MsWordTextExtractor
+  java.lang.NoClassDefFoundError
+ 	at org.apache.jackrabbit.extractor.MsWordTextExtractor.class$(MsWordTextExtractor.java:37)
+ 	at org.apache.jackrabbit.extractor.MsWordTextExtractor.<clinit>(MsWordTextExtractor.java:43)
+ 	at java.lang.Class.forName0(Native Method)
+ 	at java.lang.Class.forName(Class.java:164)
+ 	at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.<init>(JackrabbitTextExtractor.java:113)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.createTextExtractor(SearchIndex.java:881)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.doInit(SearchIndex.java:395)
+ 	at org.apache.jackrabbit.core.query.AbstractQueryHandler.init(AbstractQueryHandler.java:48)
+ 	at org.apache.jackrabbit.core.SearchManager.initializeQueryHandler(SearchManager.java:573)
+ 	at org.apache.jackrabbit.core.SearchManager.<init>(SearchManager.java:255)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.getSystemSearchManager(RepositoryImpl.java:625)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.access$300(RepositoryImpl.java:104)
+ 	at org.apache.jackrabbit.core.RepositoryImpl$WorkspaceInfo.getSearchManager(RepositoryImpl.java:1613)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initWorkspace(RepositoryImpl.java:606)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initStartupWorkspaces(RepositoryImpl.java:415)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.<init>(RepositoryImpl.java:305)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.create(RepositoryImpl.java:557)
+ 	at org.apache.jackrabbit.core.TransientRepository$2.getRepository(TransientRepository.java:245)
+ 	at org.apache.jackrabbit.core.TransientRepository.startRepository(TransientRepository.java:265)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:333)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:388)
+ 	at org.lockss.jackrabbit.FirstHop.main(FirstHop.java:26)
+  Caused by: java.lang.ClassNotFoundException: org.textmining.text.extraction.WordExtractor
+ 	at java.net.URLClassLoader$1.run(URLClassLoader.java:200)
+ 	at java.security.AccessController.doPrivileged(Native Method)
+ 	at java.net.URLClassLoader.findClass(URLClassLoader.java:188)
+ 	at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
+ 	at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:268)
+ 	at java.lang.ClassLoader.loadClass(ClassLoader.java:251)
+ 	at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:319)
+ 	at java.lang.Class.forName0(Native Method)
+ 	at java.lang.Class.forName(Class.java:164)
+ 	... 22 more
+  5913 [main] WARN org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor - Extractor
dependency not found: org.apache.jackrabbit.extractor.PdfTextExtractor
+  java.lang.NoClassDefFoundError: org/pdfbox/pdmodel/PDDocument
+ 	at java.lang.Class.forName0(Native Method)
+ 	at java.lang.Class.forName(Class.java:164)
+ 	at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.<init>(JackrabbitTextExtractor.java:113)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.createTextExtractor(SearchIndex.java:881)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.doInit(SearchIndex.java:395)
+ 	at org.apache.jackrabbit.core.query.AbstractQueryHandler.init(AbstractQueryHandler.java:48)
+ 	at org.apache.jackrabbit.core.SearchManager.initializeQueryHandler(SearchManager.java:573)
+ 	at org.apache.jackrabbit.core.SearchManager.<init>(SearchManager.java:255)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.getSystemSearchManager(RepositoryImpl.java:625)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.access$300(RepositoryImpl.java:104)
+ 	at org.apache.jackrabbit.core.RepositoryImpl$WorkspaceInfo.getSearchManager(RepositoryImpl.java:1613)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initWorkspace(RepositoryImpl.java:606)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initStartupWorkspaces(RepositoryImpl.java:415)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.<init>(RepositoryImpl.java:305)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.create(RepositoryImpl.java:557)
+ 	at org.apache.jackrabbit.core.TransientRepository$2.getRepository(TransientRepository.java:245)
+ 	at org.apache.jackrabbit.core.TransientRepository.startRepository(TransientRepository.java:265)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:333)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:388)
+ 	at org.lockss.jackrabbit.FirstHop.main(FirstHop.java:26)
+  6063 [main] INFO org.apache.jackrabbit.core.fs.local.LocalFileSystem - LocalFileSystem
initialized at path repository/repository/index
+  6422 [main] INFO org.apache.jackrabbit.core.query.lucene.SearchIndex - Index initialized:
repository/repository/index Version: 2
+  6429 [main] WARN org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor - Extractor
dependency not found: org.apache.jackrabbit.extractor.MsWordTextExtractor
+  java.lang.NoClassDefFoundError
+ 	at java.lang.Class.forName0(Native Method)
+ 	at java.lang.Class.forName(Class.java:164)
+ 	at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.<init>(JackrabbitTextExtractor.java:113)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.createTextExtractor(SearchIndex.java:881)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.doInit(SearchIndex.java:395)
+ 	at org.apache.jackrabbit.core.query.AbstractQueryHandler.init(AbstractQueryHandler.java:48)
+ 	at org.apache.jackrabbit.core.SearchManager.initializeQueryHandler(SearchManager.java:573)
+ 	at org.apache.jackrabbit.core.SearchManager.<init>(SearchManager.java:255)
+ 	at org.apache.jackrabbit.core.RepositoryImpl$WorkspaceInfo.getSearchManager(RepositoryImpl.java:1613)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initWorkspace(RepositoryImpl.java:606)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initStartupWorkspaces(RepositoryImpl.java:415)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.<init>(RepositoryImpl.java:305)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.create(RepositoryImpl.java:557)
+ 	at org.apache.jackrabbit.core.TransientRepository$2.getRepository(TransientRepository.java:245)
+ 	at org.apache.jackrabbit.core.TransientRepository.startRepository(TransientRepository.java:265)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:333)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:388)
+ 	at org.lockss.jackrabbit.FirstHop.main(FirstHop.java:26)
+  6438 [main] WARN org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor - Extractor
dependency not found: org.apache.jackrabbit.extractor.PdfTextExtractor
+  java.lang.NoClassDefFoundError: org/pdfbox/pdmodel/PDDocument
+ 	at java.lang.Class.forName0(Native Method)
+ 	at java.lang.Class.forName(Class.java:164)
+ 	at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.<init>(JackrabbitTextExtractor.java:113)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.createTextExtractor(SearchIndex.java:881)
+ 	at org.apache.jackrabbit.core.query.lucene.SearchIndex.doInit(SearchIndex.java:395)
+ 	at org.apache.jackrabbit.core.query.AbstractQueryHandler.init(AbstractQueryHandler.java:48)
+ 	at org.apache.jackrabbit.core.SearchManager.initializeQueryHandler(SearchManager.java:573)
+ 	at org.apache.jackrabbit.core.SearchManager.<init>(SearchManager.java:255)
+ 	at org.apache.jackrabbit.core.RepositoryImpl$WorkspaceInfo.getSearchManager(RepositoryImpl.java:1613)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initWorkspace(RepositoryImpl.java:606)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.initStartupWorkspaces(RepositoryImpl.java:415)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.<init>(RepositoryImpl.java:305)
+ 	at org.apache.jackrabbit.core.RepositoryImpl.create(RepositoryImpl.java:557)
+ 	at org.apache.jackrabbit.core.TransientRepository$2.getRepository(TransientRepository.java:245)
+ 	at org.apache.jackrabbit.core.TransientRepository.startRepository(TransientRepository.java:265)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:333)
+ 	at org.apache.jackrabbit.core.TransientRepository.login(TransientRepository.java:388)
+ 	at org.lockss.jackrabbit.FirstHop.main(FirstHop.java:26)
+  6448 [main] INFO org.apache.jackrabbit.core.fs.local.LocalFileSystem - LocalFileSystem
initialized at path repository/workspaces/default/index
+  6456 [main] INFO org.apache.jackrabbit.core.query.lucene.SearchIndex - Index initialized:
repository/workspaces/default/index Version: 2
+  6456 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - Repository started
+  6457 [main] INFO org.apache.jackrabbit.core.TransientRepository - Transient repository
initialized
+  6532 [main] INFO org.apache.jackrabbit.core.TransientRepository - Session opened
+  Logged in as anonymous to a Jackrabbit repository.
+  6550 [main] INFO org.apache.jackrabbit.core.TransientRepository - Session closed
+  6550 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - Shutting down repository...
+  6553 [IndexMerger] INFO org.apache.jackrabbit.core.query.lucene.IndexMerger - IndexMerger
terminated
+  6558 [main] INFO org.apache.jackrabbit.core.query.lucene.SearchIndex - Index closed: repository/repository/index
+  6558 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - shutting down workspace 'default'...
+  6559 [main] INFO org.apache.jackrabbit.core.observation.ObservationDispatcher - Notification
of EventListeners stopped.
+  6559 [IndexMerger] INFO org.apache.jackrabbit.core.query.lucene.IndexMerger - IndexMerger
terminated
+  6606 [main] INFO org.apache.jackrabbit.core.query.lucene.SearchIndex - Index closed: repository/workspaces/default/index
+  6639 [main] INFO org.apache.jackrabbit.core.persistence.bundle.DerbyPersistenceManager
- Database 'repository/workspaces/default/db' shutdown.
+  6643 [main] ERROR org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- failed to close connection, reason: No current connection., state/code: 08003/40000
+  6645 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - workspace 'default' has been
shutdown
+  6661 [main] INFO org.apache.jackrabbit.core.persistence.bundle.DerbyPersistenceManager
- Database 'repository/version/db' shutdown.
+  6661 [main] ERROR org.apache.jackrabbit.core.persistence.bundle.util.ConnectionRecoveryManager
- failed to close connection, reason: No current connection.,  state/code: 08003/40000
+  6678 [main] INFO org.apache.jackrabbit.core.RepositoryImpl - Repository has been shutdown
+  6678 [main] INFO org.apache.jackrabbit.core.TransientRepository - Transient repository
shut down
+ }}}
+ 
+ I'm worried about the "Extractor Dependencies not found".  What could be causing this?
+ 

Mime
View raw message