jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Müller" <thomas.muel...@day.com>
Subject Re: Problems indexing pdf file with DBDatastore
Date Wed, 07 May 2008 12:53:05 GMT
Hi,

This is a bug in TempFileInputStream. I will log a bug and fix the
problem. Thanks for reporting this issue.

Regards,
Thomas

On Wed, May 7, 2008 at 10:25 AM, KÖLL Claus <C.KOELL@tirol.gv.at> wrote:
> hi,
>
>  if i import a pdf file into my repository configured
>  with a dbdatastore i get following exception.
>  this happens only if i configure the datastore with copyWhenReading=true
>
>  if i set it to false no exception occurs.
>  is this a bug or should i set it always to false ?
>
>  java.io.IOException: Stream closed
>         at java.io.BufferedInputStream.getBufIfOpen(BufferedInputStream.java:156)
>         at java.io.BufferedInputStream.read(BufferedInputStream.java:315)
>         at org.apache.jackrabbit.core.data.db.TempFileInputStream.read(TempFileInputStream.java:107)
>         at java.io.BufferedInputStream.read1(BufferedInputStream.java:265)
>         at java.io.BufferedInputStream.read(BufferedInputStream.java:324)
>         at java.io.BufferedInputStream.fill(BufferedInputStream.java:229)
>         at java.io.BufferedInputStream.read(BufferedInputStream.java:246)
>         at java.io.FilterInputStream.read(FilterInputStream.java:89)
>         at java.io.PushbackInputStream.read(PushbackInputStream.java:141)
>         at org.pdfbox.io.PushBackInputStream.peek(PushBackInputStream.java:71)
>         at org.pdfbox.io.PushBackInputStream.isEOF(PushBackInputStream.java:88)
>         at org.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:370)
>         at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:176)
>         at org.apache.jackrabbit.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:69)
>         at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
>         at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
>         at org.apache.jackrabbit.core.query.lucene.NodeIndexer.addBinaryValue(NodeIndexer.java:393)
>         at org.apache.jackrabbit.core.query.lucene.NodeIndexer.addValue(NodeIndexer.java:282)
>         at org.apache.jackrabbit.core.query.lucene.NodeIndexer.createDoc(NodeIndexer.java:221)
>         at org.apache.jackrabbit.core.query.lucene.SearchIndex.createDocument(SearchIndex.java:892)
>         at org.apache.jackrabbit.core.query.lucene.SearchIndex$2.next(SearchIndex.java:543)
>         at org.apache.jackrabbit.core.query.lucene.MultiIndex.update(MultiIndex.java:428)
>         at org.apache.jackrabbit.core.query.lucene.SearchIndex.updateNodes(SearchIndex.java:527)
>         at org.apache.jackrabbit.core.SearchManager.onEvent(SearchManager.java:504)
>         at org.apache.jackrabbit.core.observation.EventConsumer.consumeEvents(EventConsumer.java:231)
>         at org.apache.jackrabbit.core.observation.ObservationDispatcher.dispatchEvents(ObservationDispatcher.java:201)
>         at org.apache.jackrabbit.core.observation.EventStateCollection.dispatch(EventStateCollection.java:425)
>         at org.apache.jackrabbit.core.state.SharedItemStateManager$Update.end(SharedItemStateManager.java:737)
>         at org.apache.jackrabbit.core.state.SharedItemStateManager.update(SharedItemStateManager.java:873)
>         at org.apache.jackrabbit.core.state.LocalItemStateManager.update(LocalItemStateManager.java:324)
>         at org.apache.jackrabbit.core.state.XAItemStateManager.update(XAItemStateManager.java:306)
>         at org.apache.jackrabbit.core.state.LocalItemStateManager.update(LocalItemStateManager.java:300)
>         at org.apache.jackrabbit.core.state.SessionItemStateManager.update(SessionItemStateManager.java:317)
>         at org.apache.jackrabbit.core.ItemImpl.save(ItemImpl.java:1247)
>         at org.apache.jackrabbit.core.SessionImpl.save(SessionImpl.java:897)
>
>  BR,
>  claus
>
>

Mime
View raw message