lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl (JIRA) <>
Subject [jira] [Commented] (LUCENE-3414) Bring Hunspell for Lucene into analysis module
Date Mon, 05 Sep 2011 20:45:09 GMT


Jan Høydahl commented on LUCENE-3414:


We now use Lucene Hunspell for a few customer deployments, and it would be great to have it
the analysis module, since it supports some 70-80 languages out of the box, and gives great
flexibility since you can edit - or augment - the dictionaries to change behaviour and fix
stemming bugs.

As a side benefit I also expect that when the Ooo dictionaries get more use in Lucene, users
will over time be able to extend and improve the dictionaries, and contribute their changes
back, benefiting also Ooo users.

> Bring Hunspell for Lucene into analysis module
> ----------------------------------------------
>                 Key: LUCENE-3414
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Chris Male
> Some time ago I along with Robert and Uwe, wrote an Stemmer which uses the Hunspell algorithm.
 It has the benefit of supporting dictionaries for a wide array of languages.   
> It seems to still be being used but has fallen out of date.  I think it would benefit
from being inside the analysis module where additional features such as decompounding support,
could be added.

This message is automatically generated by JIRA.
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message