jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Seidel. Robert" <Robert.Sei...@aeb.de>
Subject AW: Disable full-text document parsing
Date Thu, 23 May 2013 15:41:41 GMT
Hi,

I'm using a simple trick to define an index configuration with just one dummy property for
nt:resource

<?xml version="1.0"?>
<configuration xmlns:nt="http://www.jcp.org/jcr/nt/1.0">
    <index-rule nodeType="nt:resource">
        <property>DONOFULLTEXTINDEXING</property>
    </index-rule>
</configuration>

I don't know if the parsing is still done, but the properties are not saved and the index
does not grow.

Regards, Robert

-----Urspr√ľngliche Nachricht-----
Von: Thomas Auinger [mailto:thomas.auinger@byteconsult.de]
Gesendet: Donnerstag, 23. Mai 2013 16:49
An: users@jackrabbit.apache.org
Betreff: Disable full-text document parsing

Hi

How can I disable parsing of binary content stored in JR? Especially for an existing database.

We don't do a full text search anyway..

My log shows all kinds of funny exceptions, which I don't want, such as:

23.Mai 15:34:56.518 <pool-1> WARN  [   o.a.p.pdmodel.font.PDFontFactory] - Failed to
create Type1C font. Falling back to Type1 font
java.lang.IndexOutOfBoundsException: Index: 2, Size: 2
        at java.util.ArrayList$SubList.rangeCheck(Unknown Source) ~[na:1.7.0_10]
        at java.util.ArrayList$SubList.get(Unknown Source) ~[na:1.7.0_10]
        at org.apache.fontbox.cff.CharStringConverter.drawAlternatingCurve(CharStringConverter.java:306)
~[fontbox-1.3.1.jar:na]
        at org.apache.fontbox.cff.CharStringConverter.handleType1Command(CharStringConverter.java:141)
~[fontbox-1.3.1.jar:na]

or

Caused by: java.io.EOFException: Unexpected end of ZLIB input stream
        at java.util.zip.InflaterInputStream.fill(Unknown Source) ~[na:1.7.0_10]
        at java.util.zip.InflaterInputStream.read(Unknown Source) ~[na:1.7.0_10]
        at java.util.zip.ZipInputStream.read(Unknown Source) ~[na:1.7.0_10]
        at java.io.FilterInputStream.read(Unknown Source) ~[na:1.7.0_10]
        at org.apache.poi.openxml4j.util.ZipInputStreamZipEntrySource$FakeZipEntry.<init>(ZipInputStreamZipEntrySource.java:114)
~[poi-ooxml-3.7.jar:3.7]
        at org.apache.poi.openxml4j.util.ZipInputStreamZipEntrySource.<init>(ZipInputStreamZipEntrySource.java:55)
~[poi-ooxml-3.7.jar:3.7]

or

java.lang.NullPointerException: null
        at org.apache.fontbox.cff.CharStringRenderer.rlineTo(CharStringRenderer.java:291)
~[fontbox-1.3.1.jar:na]
        at org.apache.fontbox.cff.CharStringRenderer.handleCommandType1(CharStringRenderer.java:231)
~[fontbox-1.3.1.jar:na]

or

java.lang.NoSuchFieldError: filesystem
        at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:185) ~[poi-scratchpad-3.7.jar:3.7]
        at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:131) ~[poi-scratchpad-3.7.jar:3.7]

or

Caused by: java.lang.NullPointerException: null
        at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.handleEmbedded(AbstractOOXMLExtractor.java:135)
~[tika-parsers-0.8.jar:na]
        at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:115)
~[tika-parsers-0.8.jar:na]


All stack-traces originate in

        at org.apache.jackrabbit.core.query.lucene.LazyTextExtractorField$ParsingTask.run(LazyTextExtractorField.java:174)
~[jackrabbit-core-2.2.7.jar:2.2.7]
        at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) [na:1.7.0_10]
        at java.util.concurrent.FutureTask$Sync.innerRun(Unknown Source) [na:1.7.0_10]
        at java.util.concurrent.FutureTask.run(Unknown Source) [na:1.7.0_10]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(Unknown
Source) [na:1.7.0_10]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown
Source) [na:1.7.0_10]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) [na:1.7.0_10]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) [na:1.7.0_10]
        at java.lang.Thread.run(Unknown Source) [na:1.7.0_10]

Many thanks
Thomas
________________________________

AEB treffen Sie im Juni auf diesen Veranstaltungen:
transport logistic | 4.-7. Juni 2013 | M√ľnchen
EXCHAiNGE | 18.-19. Juni 2013 | Frankfurt am Main
Weitere Informationen und Terminreservierung unter: www.aeb.de/events<http://logi4.xiti.com/gopc.url?xts=487638&xtor=AD-5-[aeb%20mails]-[link%20in%20mailsignatur]-[intext]-[e-mail-signatur]-[0]-[]&url=http://www.aeb.de/de/events/index.php>

Mime
View raw message