lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jayendra Patil (JIRA)" <j...@apache.org>
Subject [jira] Created: (SOLR-2416) Solr Cell & DataImport Tika handler broken - fails to index Zip file contents
Date Wed, 09 Mar 2011 19:49:59 GMT
Solr Cell & DataImport Tika handler broken - fails to index Zip file contents
-----------------------------------------------------------------------------

                 Key: SOLR-2416
                 URL: https://issues.apache.org/jira/browse/SOLR-2416
             Project: Solr
          Issue Type: Bug
          Components: contrib - DataImportHandler, contrib - Solr Cell (Tika extraction)
    Affects Versions: 4.0
            Reporter: Jayendra Patil


Working with the latest Solr Trunk code and seems the Tika handlers for Solr Cell (ExtractingDocumentLoader.java)
and Data Import handler (TikaEntityProcessor.java) fails to index the zip file contents again.
It just indexes the file names again.
This issue was addressed some time back, late last year, but seems to have reappeared with
the latest code.

Jira for the Data Import handler part with the patch and the testcase - https://issues.apache.org/jira/browse/SOLR-2332.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message