jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Angela Schreiber <anch...@day.com>
Subject Re: TikaException while uploading odt file using WebDAV
Date Wed, 16 Jun 2010 13:04:09 GMT
jerome quaglia wrote:
> hi angela thanks for your support can show me your code line when you 
> tried to put an odt file really thanks

i didn't write extra code for that but deployed the war file and
connected with a DAV client that did the PUT for me. if it in
fact was a tika issue that causes the problem, there was no need
for digging into the details of the PUT... but i might be mistaken.

regards
angela


> regards jerome
> ----- Original Message ----- From: "Angela Schreiber" <anchela@day.com>
> To: <dev@jackrabbit.apache.org>
> Sent: Tuesday, June 08, 2010 10:35 AM
> Subject: Re: TikaException while uploading odt file using WebDAV
> 
> 
>>
>> hi jerome
>>
>> i quickly tried it with my current jackrabbit 2.2-snaptshot and
>> didn't run into this problem.
>>
>> to me this rather looks like a TIKA problem and in fact
>> the TIKA dependency was upgraded in the trunk after the jackrabbit 2.1
>> branch was split of [1].
>>
>> jukka was probably the right person to know further details
>> about possible TIKA issues.... however, TIKA-327 indeed refers to a
>> bug report in the TIKA project [2]
>>
>> regards
>> angela
>>
>> [1] https://issues.apache.org/jira/browse/JCR-2593
>> [2] https://issues.apache.org/jira/browse/TIKA-327
>>
>> jerome quaglia wrote:
>>> hello,
>>>
>>> i try to put an odt file with the method put but the file is empty on 
>>> the server webdav when i open it
>>> i use jackrrabit server standalone 2.1.0 and the sardine api for put 
>>> the file
>>> the scope of the method put is put(String url,byte[] b).
>>> i develop with eclipse on windows xp
>>>
>>> the line of the code who return the error in the log server is
>>> byte[] davdata=FileUtils.readFileToByteArray(new File("C:\\jboss-
>>> 5.1.0.GA\\server\\default\\deploy\\ecard.war\\fusions\\modelefusioner"+this 
>>>
>>> .dossnum+".odt"));
>>>
>>> webdav.put("http://127.0.0.1:8090/repository/default/"+"modelefusioner"+thi 
>>>
>>> s.dossnum+".odt", davdata);
>>>
>>>
>>>
>>>
>>>
>>> THERE IS THE FOLLOWING ERROR
>>> LazyTextExtractorField: Failed to extract text from a binary property
>>> org.apache.tika.exception.TikaException: TIKA-237: Illegal 
>>> SAXException from org.apache.tika.parser.xml.DcXMLParser@77c7bee2
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:130)
>>>     at 
>>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.JackrabbitParser.parse(JackrabbitParser.java:

>>> 189)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.LazyTextExtractorField$ParsingTask.run(LazyTe

>>> xtExtractorField.java:174)
>>>     at java.util.concurrent.Executors$RunnableAdapter.call(Unknown 
>>> Source)
>>>     at java.util.concurrent.FutureTask$Sync.innerRun(Unknown Source)
>>>     at java.util.concurrent.FutureTask.run(Unknown Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(Unkno

>>> wn Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown

>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown 
>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
>>> Source)
>>>     at java.lang.Thread.run(Unknown Source)
>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>     at 
>>> org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown 
>>> Source)
>>>     at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at 
>>> org.apache.xerces.impl.XMLVersionDetector.determineDocVersion(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
>>>     at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
>>>     at 
>>> org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
>>>     at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
>>>     at javax.xml.parsers.SAXParser.parse(Unknown Source)
>>>     at org.apache.tika.parser.xml.XMLParser.parse(XMLParser.java:72)
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
>>>     ... 11 more
>>> 04.06.2010 21:37:44 *WARN * LazyTextExtractorField: Failed to extract 
>>> text from a binary property
>>> org.apache.tika.exception.TikaException: TIKA-237: Illegal 
>>> SAXException from org.apache.tika.parser.xml.DcXMLParser@77c7bee2
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:130)
>>>     at 
>>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.JackrabbitParser.parse(JackrabbitParser.java:

>>> 189)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.LazyTextExtractorField$ParsingTask.run(LazyTe

>>> xtExtractorField.java:174)
>>>     at java.util.concurrent.Executors$RunnableAdapter.call(Unknown 
>>> Source)
>>>     at java.util.concurrent.FutureTask$Sync.innerRun(Unknown Source)
>>>     at java.util.concurrent.FutureTask.run(Unknown Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(Unkno

>>> wn Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown

>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown 
>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
>>> Source)
>>>     at java.lang.Thread.run(Unknown Source)
>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>     at 
>>> org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown 
>>> Source)
>>>     at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at 
>>> org.apache.xerces.impl.XMLVersionDetector.determineDocVersion(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
>>>     at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
>>>     at 
>>> org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
>>>     at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
>>>     at javax.xml.parsers.SAXParser.parse(Unknown Source)
>>>     at org.apache.tika.parser.xml.XMLParser.parse(XMLParser.java:72)
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
>>>     ... 11 more
>>> 04.06.2010 21:49:38 *WARN * LazyTextExtractorField: Failed to extract 
>>> text from a binary property
>>> org.apache.tika.exception.TikaException: TIKA-237: Illegal 
>>> SAXException from org.apache.tika.parser.xml.DcXMLParser@77c7bee2
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:130)
>>>     at 
>>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.JackrabbitParser.parse(JackrabbitParser.java:

>>> 189)
>>>     at 
>>> org.apache.jackrabbit.core.query.lucene.LazyTextExtractorField$ParsingTask.run(LazyTe

>>> xtExtractorField.java:174)
>>>     at java.util.concurrent.Executors$RunnableAdapter.call(Unknown 
>>> Source)
>>>     at java.util.concurrent.FutureTask$Sync.innerRun(Unknown Source)
>>>     at java.util.concurrent.FutureTask.run(Unknown Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(Unkno

>>> wn Source)
>>>     at 
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown

>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown 
>>> Source)
>>>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
>>> Source)
>>>     at java.lang.Thread.run(Unknown Source)
>>> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>>>     at 
>>> org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown 
>>> Source)
>>>     at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown 
>>> Source)
>>>     at 
>>> org.apache.xerces.impl.XMLVersionDetector.determineDocVersion(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XML11Configuration.parse(Unknown 
>>> Source)
>>>     at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
>>>     at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
>>>     at 
>>> org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
>>>     at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
>>>     at javax.xml.parsers.SAXParser.parse(Unknown Source)
>>>     at org.apache.tika.parser.xml.XMLParser.parse(XMLParser.java:72)
>>>     at 
>>> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
>>>     ... 11 more
>>> THANKS FOR SUPPORT
>>> sorry i french and dont speak englis very well thank a lot
>>>
>>>
>>>
>>>
>>>
>>>
>>
>> ---------------------------------------------------------------------------------------

>>
>> Orange vous informe que cet  e-mail a ete controle par l'anti-virus 
>> mail. Aucun virus connu a ce jour par nos services n'a ete detecte.
>>
>>
> 
> 
> 
> 


Mime
View raw message