lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Guido Medina <guido.med...@temetra.com>
Subject Re: ExtractRequestHandler, skipping errors
Date Fri, 18 Oct 2013 12:48:15 GMT
Dont, commons compress 1.5 is broken, either use 1.4.1 or later. Our app 
stopped compressing properly for a maven update.

Guido.

On 18/10/13 12:40, Roland Everaert wrote:
> I will open a JIRA issue, I suppose that I just have to create an account
> first?
>
>
> Regards,
>
>
> Roland.
>
>
> On Fri, Oct 18, 2013 at 12:05 PM, Koji Sekiguchi <koji@r.email.ne.jp> wrote:
>
>> Hi,
>>
>> I think the flag cannot ignore NoSuchMethodError. There may be something
>> wrong here?
>>
>> ... I've just checked my Solr 4.5 directories and I found Tika version is
>> 1.4.
>>
>> Tika 1.4 seems to use commons compress 1.5:
>>
>> http://svn.apache.org/viewvc/**tika/tags/1.4/tika-parsers/**
>> pom.xml?view=markup<http://svn.apache.org/viewvc/tika/tags/1.4/tika-parsers/pom.xml?view=markup>
>>
>> But I see commons-compress-1.4.1.jar in solr/contrib/extraction/lib/
>> directory.
>>
>> Can you open a JIRA issue?
>>
>> For now, you can get commons compress 1.5 and put it to the directory
>> (don't forget to remove 1.4.1 jar file).
>>
>> koji
>>
>>
>> (13/10/18 16:37), Roland Everaert wrote:
>>
>>> Hi,
>>>
>>> We already configure the extractrequesthandler to ignore tika exceptions,
>>> but it is solr that complains. The customer manage to reproduce the
>>> problem. Following is the error from the solr.log. The file type cause
>>> this
>>> exception was WMZ. It seems that something is missing in a solr class. We
>>> use SOLR 4.4.
>>>
>>> ERROR - 2013-10-17 18:13:48.902; org.apache.solr.common.**SolrException;
>>> null:java.lang.**RuntimeException: java.lang.NoSuchMethodError:
>>> org.apache.commons.compress.**compressors.**CompressorStreamFactory.**
>>> setDecompressConcatenated(Z)V
>>>       at
>>> org.apache.solr.servlet.**SolrDispatchFilter.sendError(**
>>> SolrDispatchFilter.java:673)
>>>       at
>>> org.apache.solr.servlet.**SolrDispatchFilter.doFilter(**
>>> SolrDispatchFilter.java:383)
>>>       at
>>> org.apache.solr.servlet.**SolrDispatchFilter.doFilter(**
>>> SolrDispatchFilter.java:158)
>>>       at
>>> org.apache.catalina.core.**ApplicationFilterChain.**internalDoFilter(**
>>> ApplicationFilterChain.java:**243)
>>>       at
>>> org.apache.catalina.core.**ApplicationFilterChain.**doFilter(**
>>> ApplicationFilterChain.java:**210)
>>>       at
>>> org.apache.catalina.core.**StandardWrapperValve.invoke(**
>>> StandardWrapperValve.java:222)
>>>       at
>>> org.apache.catalina.core.**StandardContextValve.invoke(**
>>> StandardContextValve.java:123)
>>>       at
>>> org.apache.catalina.core.**StandardHostValve.invoke(**
>>> StandardHostValve.java:171)
>>>       at
>>> org.apache.catalina.valves.**ErrorReportValve.invoke(**
>>> ErrorReportValve.java:99)
>>>       at
>>> org.apache.catalina.valves.**AccessLogValve.invoke(**
>>> AccessLogValve.java:953)
>>>       at
>>> org.apache.catalina.core.**StandardEngineValve.invoke(**
>>> StandardEngineValve.java:118)
>>>       at
>>> org.apache.catalina.connector.**CoyoteAdapter.service(**
>>> CoyoteAdapter.java:408)
>>>       at
>>> org.apache.coyote.http11.**AbstractHttp11Processor.**process(**
>>> AbstractHttp11Processor.java:**1023)
>>>       at
>>> org.apache.coyote.**AbstractProtocol$**AbstractConnectionHandler.**
>>> process(AbstractProtocol.java:**589)
>>>       at
>>> org.apache.tomcat.util.net.**AprEndpoint$SocketProcessor.**
>>> run(AprEndpoint.java:1852)
>>>       at java.util.concurrent.**ThreadPoolExecutor.runWorker(**Unknown
>>> Source)
>>>       at java.util.concurrent.**ThreadPoolExecutor$Worker.run(**Unknown
>>> Source)
>>>       at java.lang.Thread.run(Unknown Source)
>>> Caused by: java.lang.NoSuchMethodError:
>>> org.apache.commons.compress.**compressors.**CompressorStreamFactory.**
>>> setDecompressConcatenated(Z)V
>>>       at
>>> org.apache.tika.parser.pkg.**CompressorParser.parse(**
>>> CompressorParser.java:102)
>>>       at
>>> org.apache.tika.parser.**CompositeParser.parse(**
>>> CompositeParser.java:242)
>>>       at
>>> org.apache.tika.parser.**CompositeParser.parse(**
>>> CompositeParser.java:242)
>>>       at
>>> org.apache.tika.parser.**AutoDetectParser.parse(**
>>> AutoDetectParser.java:120)
>>>       at
>>> org.apache.solr.handler.**extraction.**ExtractingDocumentLoader.load(**
>>> ExtractingDocumentLoader.java:**219)
>>>       at
>>> org.apache.solr.handler.**ContentStreamHandlerBase.**handleRequestBody(**
>>> ContentStreamHandlerBase.java:**74)
>>>       at
>>> org.apache.solr.handler.**RequestHandlerBase.**handleRequest(**
>>> RequestHandlerBase.java:135)
>>>       at
>>> org.apache.solr.core.**RequestHandlers$**LazyRequestHandlerWrapper.**
>>> handleRequest(RequestHandlers.**java:241)
>>>       at org.apache.solr.core.SolrCore.**execute(SolrCore.java:1904)
>>>       at
>>> org.apache.solr.servlet.**SolrDispatchFilter.execute(**
>>> SolrDispatchFilter.java:659)
>>>       at
>>> org.apache.solr.servlet.**SolrDispatchFilter.doFilter(**
>>> SolrDispatchFilter.java:362)
>>>       ... 16 more
>>>
>>>
>>>
>>>
>>>
>>> On Thu, Oct 17, 2013 at 5:19 PM, Koji Sekiguchi <koji@r.email.ne.jp>
>>> wrote:
>>>
>>>   Hi Roland,
>>>>
>>>> (13/10/17 20:44), Roland Everaert wrote:
>>>>
>>>>   Hi,
>>>>> I helped a customer to deployed solr+manifoldCF and everything is going
>>>>> quite smoothly, but every time solr is raising an exception, the
>>>>> manifoldcfjob feeding
>>>>>
>>>>> solr aborts. I would like to know if it is possible to configure the
>>>>> ExtractRequestHandler to ignore errors like it seems to be possible with
>>>>> dataimporthandler and entity processors.
>>>>>
>>>>> I know that it is possible to configure the ExtractRequestHandler to
>>>>> ignore
>>>>> tika exception (We already do that) but the errors that now stops the
>>>>> mcfjobs are generated by
>>>>>
>>>>> solr itself.
>>>>>
>>>>> While it is interesting to have such option in solr, I plan to post to
>>>>> the
>>>>> manifoldcf mailing list, anyway, to know if it is possible to configure
>>>>> manifolcf to be less picky about solr errors.
>>>>>
>>>>>
>>>>>   ignoreTikaException flag might help you?
>>>> https://issues.apache.org/****jira/browse/SOLR-2480<https://issues.apache.org/**jira/browse/SOLR-2480>
>>>> <https://**issues.apache.org/jira/browse/**SOLR-2480<https://issues.apache.org/jira/browse/SOLR-2480>
>>>> koji
>>>> --
>>>> http://soleami.com/blog/****automatically-acquiring-**<http://soleami.com/blog/**automatically-acquiring-**>
>>>> synonym-knowledge-from-****wikipedia.html<http://soleami.**
>>>> com/blog/automatically-**acquiring-synonym-knowledge-**
>>>> from-wikipedia.html<http://soleami.com/blog/automatically-acquiring-synonym-knowledge-from-wikipedia.html>
>>>>
>> --
>> http://www.rondhuit.com/en/
>>


Mime
View raw message