jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From KÖLL Claus <C.KO...@TIROL.GV.AT>
Subject Problems indexing pdf file with DBDatastore
Date Wed, 07 May 2008 08:25:05 GMT
hi,

if i import a pdf file into my repository configured
with a dbdatastore i get following exception.
this happens only if i configure the datastore with copyWhenReading=true 

if i set it to false no exception occurs.
is this a bug or should i set it always to false ?

java.io.IOException: Stream closed
	at java.io.BufferedInputStream.getBufIfOpen(BufferedInputStream.java:156)
	at java.io.BufferedInputStream.read(BufferedInputStream.java:315)
	at org.apache.jackrabbit.core.data.db.TempFileInputStream.read(TempFileInputStream.java:107)
	at java.io.BufferedInputStream.read1(BufferedInputStream.java:265)
	at java.io.BufferedInputStream.read(BufferedInputStream.java:324)
	at java.io.BufferedInputStream.fill(BufferedInputStream.java:229)
	at java.io.BufferedInputStream.read(BufferedInputStream.java:246)
	at java.io.FilterInputStream.read(FilterInputStream.java:89)
	at java.io.PushbackInputStream.read(PushbackInputStream.java:141)
	at org.pdfbox.io.PushBackInputStream.peek(PushBackInputStream.java:71)
	at org.pdfbox.io.PushBackInputStream.isEOF(PushBackInputStream.java:88)
	at org.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:370)
	at org.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:176)
	at org.apache.jackrabbit.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:69)
	at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
	at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
	at org.apache.jackrabbit.core.query.lucene.NodeIndexer.addBinaryValue(NodeIndexer.java:393)
	at org.apache.jackrabbit.core.query.lucene.NodeIndexer.addValue(NodeIndexer.java:282)
	at org.apache.jackrabbit.core.query.lucene.NodeIndexer.createDoc(NodeIndexer.java:221)
	at org.apache.jackrabbit.core.query.lucene.SearchIndex.createDocument(SearchIndex.java:892)
	at org.apache.jackrabbit.core.query.lucene.SearchIndex$2.next(SearchIndex.java:543)
	at org.apache.jackrabbit.core.query.lucene.MultiIndex.update(MultiIndex.java:428)
	at org.apache.jackrabbit.core.query.lucene.SearchIndex.updateNodes(SearchIndex.java:527)
	at org.apache.jackrabbit.core.SearchManager.onEvent(SearchManager.java:504)
	at org.apache.jackrabbit.core.observation.EventConsumer.consumeEvents(EventConsumer.java:231)
	at org.apache.jackrabbit.core.observation.ObservationDispatcher.dispatchEvents(ObservationDispatcher.java:201)
	at org.apache.jackrabbit.core.observation.EventStateCollection.dispatch(EventStateCollection.java:425)
	at org.apache.jackrabbit.core.state.SharedItemStateManager$Update.end(SharedItemStateManager.java:737)
	at org.apache.jackrabbit.core.state.SharedItemStateManager.update(SharedItemStateManager.java:873)
	at org.apache.jackrabbit.core.state.LocalItemStateManager.update(LocalItemStateManager.java:324)
	at org.apache.jackrabbit.core.state.XAItemStateManager.update(XAItemStateManager.java:306)
	at org.apache.jackrabbit.core.state.LocalItemStateManager.update(LocalItemStateManager.java:300)
	at org.apache.jackrabbit.core.state.SessionItemStateManager.update(SessionItemStateManager.java:317)
	at org.apache.jackrabbit.core.ItemImpl.save(ItemImpl.java:1247)
	at org.apache.jackrabbit.core.SessionImpl.save(SessionImpl.java:897)

BR,
claus


Mime
View raw message