lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emmanuel Espina (Created) (JIRA)" <>
Subject [jira] [Created] (SOLR-3246) UpdateRequestProcessor to extract Solr XML from rich documents
Date Wed, 14 Mar 2012 17:04:41 GMT
UpdateRequestProcessor to extract Solr XML from rich documents

                 Key: SOLR-3246
             Project: Solr
          Issue Type: New Feature
          Components: update
            Reporter: Emmanuel Espina
            Priority: Minor

This would be an update request handler to save a file with the xml that represents the document
in an external directory. The original
idea behind this was to add it to the processing chain of the ExtractingRequestHandler to
store an already parsed version of the docs. This storage of pre-parsed documents will make
the re indexing of the entire index faster (avoiding the Tika phase, and just sending the
xml to the standard update processor).
As a side effect, extracting the xml can make debugging of rich docs easier.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message