lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Ingersoll (JIRA)" <>
Subject [jira] Commented: (SOLR-1902) Tika no longer properly extracts content in Solr
Date Mon, 03 May 2010 22:35:16 GMT


Grant Ingersoll commented on SOLR-1902:

Further debugging shows that on startup, Tika did not load any parsers, which is the difference
as to why the tests pass.

> Tika no longer properly extracts content in Solr
> ------------------------------------------------
>                 Key: SOLR-1902
>                 URL:
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - Solr Cell (Tika extraction)
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
> See
> It appears that since the upgrade to Tika 0.7, Tika is now selecting an EmptyParser when
uploading docs, which then outputs an empty XHTML representation.  Still, it's strange that
the tests pass.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message