lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] Updated: (LUCENE-2298) Polish Analyzer
Date Sat, 06 Mar 2010 00:49:27 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir updated LUCENE-2298:
--------------------------------

    Attachment: LUCENE-2298.patch

Attached is an initial patch. i made the following changes to the original code:
* add generics and other java 5 things
* add junit tests
* add polish analyzer, with stopwords list from carrot2
* repackaging

you will need to extract the stemming table into resources/org.apache.lucene.analysis.pl folder,
right where the Polish stopwords are. I will attach this separately. 

> Polish Analyzer
> ---------------
>
>                 Key: LUCENE-2298
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2298
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/analyzers
>    Affects Versions: 3.1
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>             Fix For: 3.1
>
>         Attachments: LUCENE-2298.patch
>
>
> Andrzej Bialecki has written a Polish stemmer and provided stemming tables for it under
Apache License.
> You can read more about it here: http://www.getopt.org/stempel/
> In reality, the stemmer is general code and we could use it for more languages too perhaps.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message