jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From KÖLL Claus <C.KO...@TIROL.GV.AT>
Subject AW: AW: AW: Jackrabbit indexing in a separate thread
Date Mon, 27 Feb 2012 12:38:44 GMT
Hi Anton,

It seems that you index the pdf File as fulltext ?!?

>org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:530)
>     at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:172)
>     at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:878)
>     at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:843)
>     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:74)
>     at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:91)

I think you have disabled it ?
Indexing huge pdf files will take some time and memory :-)

greets
claus

Mime
View raw message