lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Koji Sekiguchi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-2512) uima: add an ability to skip runtime error in AnalysisEngine
Date Thu, 12 May 2011 10:14:48 GMT

    [ https://issues.apache.org/jira/browse/SOLR-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13032339#comment-13032339
] 

Koji Sekiguchi commented on SOLR-2512:
--------------------------------------

Hi Tommaso, thank you for updating the patch!

In my patch, I try to log the first 100 chars of the target text in the error message because
an online NLP service I'm using is error-prone when I post a large text. But you are using
SolrInputDocument in the updated patch. I'd like my method rather than logging whole solr
document.

I think that users who set ignoreErrors=true want to know the fact that the error occurs,
but don't want to see whole document in the error message.

> uima: add an ability to skip runtime error in AnalysisEngine
> ------------------------------------------------------------
>
>                 Key: SOLR-2512
>                 URL: https://issues.apache.org/jira/browse/SOLR-2512
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 3.1
>            Reporter: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.2, 4.0
>
>         Attachments: SOLR-2512.patch, SOLR-2512.patch
>
>
> Currently, if AnalysisEngine throws an exception during processing a text, whole adding
docs go fail. Because online NLP services are error-prone, users should be able to choose
whether solr skips the text processing (but source text can be indexed) for the document or
throws a runtime exception so that solr can stop adding documents entirely.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message