lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Della Bitta <michael.della.bi...@appinions.com>
Subject Re: Solr 4.3.0 Cloud Issue indexing pdf documents
Date Mon, 10 Jun 2013 14:59:59 GMT
Glad that helped. I'm going to go buy a lottery ticket now! :)

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <https://twitter.com/Appinions> | g+:
plus.google.com/appinions
w: appinions.com <http://www.appinions.com/>


On Mon, Jun 10, 2013 at 5:56 AM, Mark Wilson <mw8@sanger.ac.uk> wrote:

> Hi Michael
>
> Thanks very much for that, it did indeed solve the problem.
>
> I had it setup on my internal servers, as I have a separate script for
> tomcat startup, but forgot all about it on the Amazon Cloud servers.
>
> For info
>
> I added
> CATALINA_OPTS="-Djava.awt.headless=true"
> export CATALINA_OPTS
>
> to $tomcat_home/bin/setenv.sh
>
> Thanks again
>
> Regards Mark
>
>
> On 07/06/2013 19:29, "Michael Della Bitta"
> <michael.della.bitta@appinions.com> wrote:
>
> > Hi Mark,
> >
> > This is a total shot in the dark, but does
> > passing  -Djava.awt.headless=true when you run the server help at all?
> >
> > More on awt headless mode:
> > http://www.oracle.com/technetwork/articles/javase/headless-136834.html
> >
> > Michael Della Bitta
> >
> > Applications Developer
> >
> > o: +1 646 532 3062  | c: +1 917 477 7906
> >
> > appinions inc.
> >
> > “The Science of Influence Marketing”
> >
> > 18 East 41st Street
> >
> > New York, NY 10017
> >
> > t: @appinions <https://twitter.com/Appinions> | g+:
> > plus.google.com/appinions
> > w: appinions.com <http://www.appinions.com/>
> >
> >
> > On Fri, Jun 7, 2013 at 11:31 AM, Mark Wilson <mw8@sanger.ac.uk> wrote:
> >
> >> Hi
> >>
> >> I am having an issue with adding pdf documents to a SolrCloud index I
> have
> >> setup.
> >>
> >> I can index pdf documents fine using 4.3.0 on my local box, but I have a
> >> SolrCloud instance setup on the Amazon Cloud (Using 2 servers) and I get
> >> Error.
> >>
> >> It seems that it is not loading org.apache.pdfbox.pdmodel.PDPage.
> However,
> >> the jar is in the directory, and referenced in the solrconfig.xml file
> >>
> >>   <lib dir="/www/solr/lib/contrib/extraction/lib" regex=".*\.jar" />
> >>   <lib dir="/www/solr/lib/" regex="solr-cell-\d.*\.jar" />
> >>
> >>   <lib dir="/www/solr/lib/contrib/clustering/lib/" regex=".*\.jar" />
> >>   <lib dir="/www/solr/lib/" regex="solr-clustering-\d.*\.jar" />
> >>
> >>   <lib dir="/www/solr/lib/contrib/langid/lib/" regex=".*\.jar" />
> >>   <lib dir="/www/solr/lib/" regex="solr-langid-\d.*\.jar" />
> >>
> >>   <lib dir="/www/solr/lib/contrib/velocity/lib" regex=".*\.jar" />
> >>   <lib dir="/www/solr/lib/" regex="solr-velocity-\d.*\.jar" />
> >>
> >> When I start Tomcat, I can see that the file has loaded.
> >>
> >> 2705 [coreLoadExecutor-4-thread-3] INFO
> >> org.apache.solr.core.SolrResourceLoader  ­ Adding
> >> 'file:/www/solr/lib/contrib/extraction/lib/pdfbox-1.7.1.jar' to
> classloader
> >>
> >> But when I try to add a document.
> >>
> >> java
> >> -Durl=
> >> http://ec2-blah-blaheu-west-1.compute.amazonaws.com:8080/solr/quosa2-c
> >> ollection/update/extract -Dparams=literal.id=pdf1 -Dtype=text/pdf -jar
> >> post.jar 2008.Genomics.pdf
> >>
> >>
> >> I get this error. I¹m running on an Ubuntu machine.
> >>
> >> Linux ip-10-229-125-163 3.5.0-21-generic #32-Ubuntu SMP Tue Dec 11
> 18:51:59
> >> UTC 2012 x86_64 x86_64 x86_64 GNU/Linux
> >>
> >> Error log.
> >>
> >> 88168 [http-bio-8080-exec-1] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard1_replica1] webapp=/solr path=/update/extract
> >> params={literal.id=pdf1} {} 0 1534
> >> 88180 [http-bio-8080-exec-1] ERROR
> >> org.apache.solr.servlet.SolrDispatchFilter  ­
> >> null:java.lang.RuntimeException: java.lang.UnsatisfiedLinkError:
> >> /usr/lib/jvm/java-7-oracle/jre/lib/amd64/xawt/libmawt.so:
> libXrender.so.1:
> >> cannot open shared object file: No such file or directory
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java
> >> :670)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 380)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 155)
> >>     at
> >>
> >>
> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(Application
> >> FilterChain.java:243)
> >>     at
> >>
> >>
> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterCh
> >> ain.java:210)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.ja
> >> va:222)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.ja
> >> va:123)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:171
> >> )
> >>     at
> >>
> >>
> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:99)
> >>     at
> >>
> org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:947)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java
> >> :118)
> >>     at
> >>
> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:408)
> >>     at
> >>
> >>
> org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Proce
> >> ssor.java:1009)
> >>     at
> >>
> >>
> org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(Abstrac
> >> tProtocol.java:589)
> >>     at
> >>
> >>
> org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:
> >> 310)
> >>     at
> >>
> >>
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:11
> >> 45)
> >>     at
> >>
> >>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:6
> >> 15)
> >>     at java.lang.Thread.run(Thread.java:722)
> >> Caused by: java.lang.UnsatisfiedLinkError:
> >> /usr/lib/jvm/java-7-oracle/jre/lib/amd64/xawt/libmawt.so:
> libXrender.so.1:
> >> cannot open shared object file: No such file or directory
> >>     at java.lang.ClassLoader$NativeLibrary.load(Native Method)
> >>     at java.lang.ClassLoader.loadLibrary1(ClassLoader.java:1939)
> >>     at java.lang.ClassLoader.loadLibrary0(ClassLoader.java:1864)
> >>     at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1825)
> >>     at java.lang.Runtime.load0(Runtime.java:792)
> >>     at java.lang.System.load(System.java:1059)
> >>     at java.lang.ClassLoader$NativeLibrary.load(Native Method)
> >>     at java.lang.ClassLoader.loadLibrary1(ClassLoader.java:1939)
> >>     at java.lang.ClassLoader.loadLibrary0(ClassLoader.java:1864)
> >>     at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1846)
> >>     at java.lang.Runtime.loadLibrary0(Runtime.java:845)
> >>     at java.lang.System.loadLibrary(System.java:1084)
> >>     at
> sun.security.action.LoadLibraryAction.run(LoadLibraryAction.java:67)
> >>     at
> sun.security.action.LoadLibraryAction.run(LoadLibraryAction.java:47)
> >>     at java.security.AccessController.doPrivileged(Native Method)
> >>     at java.awt.Toolkit.loadLibraries(Toolkit.java:1648)
> >>     at java.awt.Toolkit.<clinit>(Toolkit.java:1670)
> >>     at java.awt.Color.<clinit>(Color.java:275)
> >>     at org.apache.pdfbox.pdmodel.PDPage.<clinit>(PDPage.java:72)
> >>     at
> org.apache.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:212)
> >>     at
> org.apache.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:184)
> >>     at
> >>
> >>
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.ja
> >> va:211)
> >>     at
> >>
> org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:328)
> >>     at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:72)
> >>     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:153)
> >>     at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> >>     at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> >>     at
> >> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> >>     at
> >>
> >>
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingD
> >> ocumentLoader.java:219)
> >>     at
> >>
> >>
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentSt
> >> reamHandlerBase.java:74)
> >>     at
> >>
> >>
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.
> >> java:135)
> >>     at
> >>
> >>
> org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handleRequest
> >> (RequestHandlers.java:242)
> >>     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1816)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:6
> >> 56)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 359)
> >>     ... 16 more
> >>
> >> 88234 [http-bio-8080-exec-2] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 88268 [http-bio-8080-exec-2] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard1_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7b2d9e10;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_15,generation=41,filenames=[segments_15]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard1_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7b2d9e10;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_16,generation=42,filenames=[segments_16]
> >> 88268 [http-bio-8080-exec-2] INFO  org.apache.solr.core.SolrCore  ­
> newest
> >> commit = 42[segments_16]
> >> 88269 [http-bio-8080-exec-2] INFO
>  org.apache.solr.search.SolrIndexSearcher
> >> ­ Opening Searcher@6175f8cb main
> >> 88270 [searcherExecutor-6-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener sending requests to Searcher@6175f8cb
> >> main{StandardDirectoryReader(segments_15:9:nrt)}
> >> 88271 [searcherExecutor-6-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener done.
> >> 88273 [searcherExecutor-6-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> [quosa2-collection_shard1_replica1] Registered new searcher
> >> Searcher@6175f8cb main{StandardDirectoryReader(segments_15:9:nrt)}
> >> 88273 [http-bio-8080-exec-2] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> end_commit_flush
> >> 88288 [http-bio-8080-exec-2] INFO
> >>  org.apache.solr.update.SolrCmdDistributor
> >> ­ Distrib commit to:[StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard1_replica3/,
> >> StdNode:
> >> http://10.229.125.163:8080/solr/quosa2-collection_shard2_replica1/,
> >> StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard2_replica3/,
> >> StdNode:
> >> http://10.229.125.163:8080/solr/quosa2-collection_shard3_replica1/,
> >> StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard3_replica3/]
> >>
> >>
> params:commit_end_point=true&commit=true&softCommit=false&waitSearcher=true&
> >> expungeDeletes=false
> >> 88321 [http-bio-8080-exec-3] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 88334 [http-bio-8080-exec-4] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 88351 [http-bio-8080-exec-3] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard2_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7d5549ce;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_19,generation=45,filenames=[segments_19]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard2_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7d5549ce;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1a,generation=46,filenames=[segments_1a]
> >> 88354 [http-bio-8080-exec-3] INFO  org.apache.solr.core.SolrCore  ­
> newest
> >> commit = 46[segments_1a]
> >> 88357 [http-bio-8080-exec-3] INFO
>  org.apache.solr.search.SolrIndexSearcher
> >> ­ Opening Searcher@775f6084 main
> >> 88360 [http-bio-8080-exec-4] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard3_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@782256a2;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_19,generation=45,filenames=[segments_19]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard3_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@782256a2;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1a,generation=46,filenames=[segments_1a]
> >> 88362 [http-bio-8080-exec-4] INFO  org.apache.solr.core.SolrCore  ­
> newest
> >> commit = 46[segments_1a]
> >> 88363 [http-bio-8080-exec-4] INFO
>  org.apache.solr.search.SolrIndexSearcher
> >> ­ Opening Searcher@4a2415c5 main
> >> 88364 [searcherExecutor-5-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener sending requests to Searcher@4a2415c5
> >> main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 88365 [searcherExecutor-5-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener done.
> >> 88367 [searcherExecutor-5-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> [quosa2-collection_shard3_replica1] Registered new searcher
> >> Searcher@4a2415c5 main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 88367 [http-bio-8080-exec-4] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> end_commit_flush
> >> 88368 [searcherExecutor-7-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener sending requests to Searcher@775f6084
> >> main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 88368 [searcherExecutor-7-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> QuerySenderListener done.
> >> 88367 [http-bio-8080-exec-4] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard3_replica1] webapp=/solr path=/update
> >>
> >>
> params={waitSearcher=true&commit=true&wt=javabin&expungeDeletes=false&commit
> >> _end_point=true&version=2&softCommit=false} {commit=} 0 33
> >> 88371 [http-bio-8080-exec-3] INFO  org.apache.solr.update.UpdateHandler
>  ­
> >> end_commit_flush
> >> 88370 [searcherExecutor-7-thread-1] INFO  org.apache.solr.core.SolrCore
>  ­
> >> [quosa2-collection_shard2_replica1] Registered new searcher
> >> Searcher@775f6084 main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 88376 [http-bio-8080-exec-3] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard2_replica1] webapp=/solr path=/update
> >>
> >>
> params={waitSearcher=true&commit=true&wt=javabin&expungeDeletes=false&commit
> >> _end_point=true&version=2&softCommit=false} {commit=} 0 55
> >> 88379 [http-bio-8080-exec-2] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard1_replica1] webapp=/solr path=/update/extract
> >> params={commit=true&literal.id=pdf1} {commit=} 0 149
> >>
> >>
> >> When I try to add the file again, I get another error.
> >>
> >> SimplePostTool version 1.5
> >> Posting files to base url
> >>
> http://server:8080/solr/quosa2-collection/update/extract?literal.id=pdf1
> >> using content-type text/pdf..
> >> POSTing file 2008.Genomics.pdf
> >> SimplePostTool: WARNING: Solr returned an error #500 Internal Server
> Error
> >> SimplePostTool: WARNING: IOException while reading response:
> >> java.io.IOException: Server returned HTTP response code: 500 for URL:
> >>
> >>
> http://ec2-54-228-131-56.eu-west-1.compute.amazonaws.com:8080/solr/quosa2-co
> >> llection/update/extract?literal.id=pdf1
> >> 1 files indexed.
> >> COMMITting Solr index changes to
> >>
> >>
> http://ec2-54-228-131-56.eu-west-1.compute.amazonaws.com:8080/solr/quosa2-co
> >> llection/update/extract?literal.id=pdf1..
> >> Time spent: 0:00:00.341
> >>
> >> Error Log
> >>
> >> 731088 [http-bio-8080-exec-8] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard1_replica1] webapp=/solr path=/update/extract
> >> params={literal.id=pdf1} {} 0 170
> >> 731094 [http-bio-8080-exec-8] ERROR
> >> org.apache.solr.servlet.SolrDispatchFilter  ­
> >> null:java.lang.RuntimeException: java.lang.NoClassDefFoundError: Could
> not
> >> initialize class org.apache.pdfbox.pdmodel.PDPage
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java
> >> :670)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 380)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 155)
> >>     at
> >>
> >>
> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(Application
> >> FilterChain.java:243)
> >>     at
> >>
> >>
> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterCh
> >> ain.java:210)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.ja
> >> va:222)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.ja
> >> va:123)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:171
> >> )
> >>     at
> >>
> >>
> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:99)
> >>     at
> >>
> org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:947)
> >>     at
> >>
> >>
> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java
> >> :118)
> >>     at
> >>
> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:408)
> >>     at
> >>
> >>
> org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Proce
> >> ssor.java:1009)
> >>     at
> >>
> >>
> org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(Abstrac
> >> tProtocol.java:589)
> >>     at
> >>
> >>
> org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:
> >> 310)
> >>     at
> >>
> >>
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:11
> >> 45)
> >>     at
> >>
> >>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:6
> >> 15)
> >>     at java.lang.Thread.run(Thread.java:722)
> >> Caused by: java.lang.NoClassDefFoundError: Could not initialize class
> >> org.apache.pdfbox.pdmodel.PDPage
> >>     at
> org.apache.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:212)
> >>     at
> org.apache.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:184)
> >>     at
> >>
> >>
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.ja
> >> va:211)
> >>     at
> >>
> org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:328)
> >>     at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:72)
> >>     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:153)
> >>     at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> >>     at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> >>     at
> >> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> >>     at
> >>
> >>
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingD
> >> ocumentLoader.java:219)
> >>     at
> >>
> >>
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentSt
> >> reamHandlerBase.java:74)
> >>     at
> >>
> >>
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.
> >> java:135)
> >>     at
> >>
> >>
> org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handleRequest
> >> (RequestHandlers.java:242)
> >>     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1816)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:6
> >> 56)
> >>     at
> >>
> >>
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:
> >> 359)
> >>     ... 16 more
> >>
> >> 731110 [http-bio-8080-exec-9] INFO
>  org.apache.solr.update.UpdateHandler  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 731143 [http-bio-8080-exec-9] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard1_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7b2d9e10;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_16,generation=42,filenames=[segments_16]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard1_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7b2d9e10;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_17,generation=43,filenames=[segments_17]
> >> 731144 [http-bio-8080-exec-9] INFO  org.apache.solr.core.SolrCore  ­
> newest
> >> commit = 43[segments_17]
> >> 731145 [http-bio-8080-exec-9] INFO
> >>  org.apache.solr.search.SolrIndexSearcher
> >> ­ Opening Searcher@64148dd8 main
> >> 731146 [searcherExecutor-6-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener sending requests to Searcher@64148dd8
> >> main{StandardDirectoryReader(segments_15:9:nrt)}
> >> 731147 [searcherExecutor-6-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener done.
> >> 731148 [searcherExecutor-6-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> [quosa2-collection_shard1_replica1] Registered new searcher
> >> Searcher@64148dd8 main{StandardDirectoryReader(segments_15:9:nrt)}
> >> 731149 [http-bio-8080-exec-9] INFO
>  org.apache.solr.update.UpdateHandler  ­
> >> end_commit_flush
> >> 731149 [http-bio-8080-exec-9] INFO
> >> org.apache.solr.update.SolrCmdDistributor  ­ Distrib commit to:[StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard1_replica3/,
> >> StdNode:
> >> http://10.229.125.163:8080/solr/quosa2-collection_shard2_replica1/,
> >> StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard2_replica3/,
> >> StdNode:
> >> http://10.229.125.163:8080/solr/quosa2-collection_shard3_replica1/,
> >> StdNode:
> >> http://10.234.215.24:8080/solr/quosa2-collection_shard3_replica3/]
> >>
> >>
> params:commit_end_point=true&commit=true&softCommit=false&waitSearcher=true&
> >> expungeDeletes=false
> >> 731170 [http-bio-8080-exec-1] INFO
>  org.apache.solr.update.UpdateHandler  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 731174 [http-bio-8080-exec-10] INFO
>  org.apache.solr.update.UpdateHandler
> >>  ­
> >> start
> >>
> >>
> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=fa
> >> lse,softCommit=false,prepareCommit=false}
> >> 731191 [http-bio-8080-exec-1] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard3_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@782256a2;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1a,generation=46,filenames=[segments_1a]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard3_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@782256a2;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1b,generation=47,filenames=[segments_1b]
> >> 731192 [http-bio-8080-exec-1] INFO  org.apache.solr.core.SolrCore  ­
> newest
> >> commit = 47[segments_1b]
> >> 731192 [http-bio-8080-exec-1] INFO
> >>  org.apache.solr.search.SolrIndexSearcher
> >> ­ Opening Searcher@49bbdad main
> >> 731193 [searcherExecutor-5-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener sending requests to Searcher@49bbdad
> >> main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 731200 [searcherExecutor-5-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener done.
> >> 731202 [http-bio-8080-exec-10] INFO  org.apache.solr.core.SolrCore  ­
> >> SolrDeletionPolicy.onCommit: commits:num=2
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard2_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7d5549ce;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1a,generation=46,filenames=[segments_1a]
> >>
> >> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@
> >> /www/so
> >> lr/home1/quosa2-collection_shard2_replica1/data/index
> >> lockFactory=org.apache.lucene.store.NativeFSLockFactory@7d5549ce;
> >> maxCacheMB=48.0
> >>
> maxMergeSizeMB=4.0),segFN=segments_1b,generation=47,filenames=[segments_1b]
> >> 731202 [http-bio-8080-exec-10] INFO  org.apache.solr.core.SolrCore  ­
> >> newest
> >> commit = 47[segments_1b]
> >> 731203 [http-bio-8080-exec-10] INFO
> >> org.apache.solr.search.SolrIndexSearcher  ­ Opening Searcher@5ddf50femain
> >> 731193 [http-bio-8080-exec-1] INFO
>  org.apache.solr.update.UpdateHandler  ­
> >> end_commit_flush
> >> 731205 [searcherExecutor-7-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener sending requests to Searcher@5ddf50fe
> >> main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 731205 [searcherExecutor-7-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> QuerySenderListener done.
> >> 731207 [searcherExecutor-7-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> [quosa2-collection_shard2_replica1] Registered new searcher
> >> Searcher@5ddf50fe main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 731204 [searcherExecutor-5-thread-1] INFO
>  org.apache.solr.core.SolrCore  ­
> >> [quosa2-collection_shard3_replica1] Registered new searcher
> >> Searcher@49bbdad
> >> main{StandardDirectoryReader(segments_19:1:nrt)}
> >> 731207 [http-bio-8080-exec-10] INFO
>  org.apache.solr.update.UpdateHandler
> >>  ­
> >> end_commit_flush
> >> 731208 [http-bio-8080-exec-10] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard2_replica1] webapp=/solr path=/update
> >>
> >>
> params={waitSearcher=true&commit=true&wt=javabin&expungeDeletes=false&commit
> >> _end_point=true&version=2&softCommit=false} {commit=} 0 35
> >> 731210 [http-bio-8080-exec-1] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard3_replica1] webapp=/solr path=/update
> >>
> >>
> params={waitSearcher=true&commit=true&wt=javabin&expungeDeletes=false&commit
> >> _end_point=true&version=2&softCommit=false} {commit=} 0 41
> >> 731216 [http-bio-8080-exec-9] INFO
> >> org.apache.solr.update.processor.LogUpdateProcessor  ­
> >> [quosa2-collection_shard1_replica1] webapp=/solr path=/update/extract
> >> params={commit=true&literal.id=pdf1} {commit=} 0 106
> >>
> >>
> >> Does anyone have any idea how to resolve this?
> >>
> >> Thanks in advance
> >>
> >> Mark
> >>
> >>
> >>
> >>
> >>
> >> --
> >>  The Wellcome Trust Sanger Institute is operated by Genome Research
> >>  Limited, a charity registered in England with number 1021457 and a
> >>  company registered in England with number 2742969, whose registered
> >>  office is 215 Euston Road, London, NW1 2BE.
> >>
> >>
>
>
>
> --
>  The Wellcome Trust Sanger Institute is operated by Genome Research
>  Limited, a charity registered in England with number 1021457 and a
>  company registered in England with number 2742969, whose registered
>  office is 215 Euston Road, London, NW1 2BE.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message