lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Mei" <vmsluc...@gmail.com>
Subject digester/lucene runtime problems
Date Wed, 03 Jan 2007 15:48:36 GMT
Hi,

We have 500k xml documents in a file. We ran the digest/lucene and get the
following error messages. We had ran a smaller size file with 20k xml
documents without any problems. Can anyone help us to resolve this problem?
Thank you very much.

Regards,

Mark


Jan 3, 2007 10:17:37 AM org.apache.commons.digester.Digester endElement
SEVERE: End event threw exception
java.lang.reflect.InvocationTargetException
        at sun.reflect.GeneratedMethodAccessor23.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(
DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.commons.beanutils.MethodUtils.invokeMethod(
MethodUtils.java:216)
        at org.apache.commons.digester.SetNextRule.end(SetNextRule.java:217)
        at org.apache.commons.digester.Rule.end(Rule.java:253)
        at org.apache.commons.digester.Digester.endElement(Digester.java
:1222)
        at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanEndElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown
Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
        at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown
Source)
        at org.apache.commons.digester.Digester.parse(Digester.java:1728)
        at DOCUMENTParserIndexer.main(DOCUMENTParserIndexer.java:127)
Caused by: java.lang.NullPointerException
        at org.apache.lucene.index.FieldsReader.doc(FieldsReader.java:82)
        at org.apache.lucene.index.SegmentReader.document(SegmentReader.java
:284)
        at org.apache.lucene.index.SegmentMerger.mergeFields(
SegmentMerger.java:186)
        at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java
:88)
        at org.apache.lucene.index.IndexWriter.mergeSegments(
IndexWriter.java:681)
        at org.apache.lucene.index.IndexWriter.mergeSegments(
IndexWriter.java:658)
        at org.apache.lucene.index.IndexWriter.maybeMergeSegments(
IndexWriter.java:646)
        at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java
:453)
        at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java
:436)
        at DOCUMENTParserIndexer.addDOCUMENT(DOCUMENTParserIndexer.java:59)
        ... 18 more
Exception in thread "main" java.lang.NullPointerException
        at org.apache.commons.digester.Digester.createSAXException(
Digester.java:3181)
        at org.apache.commons.digester.Digester.createSAXException(
Digester.java:3207)
        at org.apache.commons.digester.Digester.endElement(Digester.java
:1225)
        at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanEndElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown
Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
        at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown
Source)
        at org.apache.commons.digester.Digester.parse(Digester.java:1728)
        at DOCUMENTParserIndexer.main(DOCUMENTParserIndexer.java:127)

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message