lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Luca Cavanna (JIRA)" <>
Subject [jira] [Updated] (LUCENE-3976) Improve error messages for unsupported Hunspell formats
Date Mon, 07 May 2012 12:09:48 GMT


Luca Cavanna updated LUCENE-3976:

    Attachment: LUCENE-3976.patch

The patch tries to address unexpected errors while parsing affix files and dictionaries. I
just added an external try catch with a generic "Error while parsing the affix/dictionary
file", in my opinion better than just eventually throwing some unchecked exception. Let me
know if there's something else we can improve meanwhile.
> Improve error messages for unsupported Hunspell formats
> -------------------------------------------------------
>                 Key: LUCENE-3976
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Chris Male
>         Attachments: LUCENE-3976.patch, LUCENE-3976.patch
> Our hunspell implementation is never going to be able to support the huge variety of
formats that are out there, especially since our impl is based on papers written on the topic
rather than being a pure port.
> Recently we ran into the following suffix rule:
> {noformat}SFX CA 0 /CaCp{noformat}
> Due to the missing regex conditional, an AOE was being thrown, which made it difficult
to diagnose the problem.
> We should instead try to provide better error messages showing what we were unable to

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message