lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mikhail Khludnev <mkhlud...@griddynamics.com>
Subject Re: Importing XML into SOLR, identifying a failed import document
Date Wed, 04 Feb 2015 07:56:25 GMT
giving
https://github.com/apache/lucene-solr/blob/trunk/solr/contrib/dataimporthandler/src/java/org/apache/solr/handler/dataimport/XPathEntityProcessor.java#L309
you need to specify
onError="continue"
and check the log for LOG.warn("Failed for url : "...

Developers, would you mind to fix typo: applying XSL Transformeation  ?


On Tue, Feb 3, 2015 at 9:10 PM, Morris, Paul E. <pmorris@nsf.gov> wrote:

> Hi All,
>
> I'm using SOLR 4.9.0 to import XML using /dataimport from the dashboard
> and a suitably configured xml-data-config.xml file.
>
> Everything works fine, but very occasionally I encounter a bad XML file
> and the XML importhandler fails with the following error, and the index
> rolls-back.
>
> Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException:
> Exception in applying XSL Transformeation Processing Document # 1582437
>
> The error shows the Processing Document #, but I have millions of XML
> files, which contain more than one <Document> and are distributed across
> many recursive folders. It is proving really hard to track down the
> Document that SOLR is rejecting, based just on the document number.
>
> Has anyone else hit this sort of problem? Could the filename of the XML
> file that failed be exposed perhaps?
>
> With thanks,
>
> Paul
>
>
>


-- 
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics

<http://www.griddynamics.com>
<mkhludnev@griddynamics.com>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message