lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Male (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (LUCENE-3414) Bring Hunspell for Lucene into analysis module
Date Tue, 06 Sep 2011 04:55:10 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris Male updated LUCENE-3414:
-------------------------------

    Attachment: LUCENE-3414.patch

Patch with a port of the code.

Because most of the dictionaries are L/GPL, I've written my own dumb stupid dictionary for
test purposes.

During testing I discovered a long standing bug to do with recursive application of rules
This has now been fixed.

Code now is also version aware, as required by the CharArray* data structures.

> Bring Hunspell for Lucene into analysis module
> ----------------------------------------------
>
>                 Key: LUCENE-3414
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3414
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Chris Male
>         Attachments: LUCENE-3414.patch
>
>
> Some time ago I along with Robert and Uwe, wrote an Stemmer which uses the Hunspell algorithm.
 It has the benefit of supporting dictionaries for a wide array of languages.   
> It seems to still be being used but has fallen out of date.  I think it would benefit
from being inside the analysis module where additional features such as decompounding support,
could be added.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message