manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erlend GarĂ¥sen (JIRA) <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-200) Solr connector should treat TikaException the same as a 400 response
Date Fri, 20 May 2011 11:30:47 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13036784#comment-13036784
] 

Erlend GarĂ¥sen commented on CONNECTORS-200:
-------------------------------------------

I checked out the latest from trunk and did a test crawl with documents I know will return
a TikaException due to the following Tika bug:
https://issues.apache.org/jira/browse/TIKA-418

The job ended successfully and MCF did not try to fetch the affected documents over and over
again even though TikaExceptions were thrown. In other words, it seems to work as it should
now.

> Solr connector should treat TikaException the same as a 400 response
> --------------------------------------------------------------------
>
>                 Key: CONNECTORS-200
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-200
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 0.1, ManifoldCF 0.2, ManifoldCF 0.3
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>
> Solr connector should treat TikaException the same as a 400 response, which is to skip
the document.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message