lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-152) [PATCH] KStem for Lucene
Date Thu, 02 Jun 2011 12:22:47 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13042732#comment-13042732
] 

Robert Muir commented on LUCENE-152:
------------------------------------

Ryan: maybe, I thought of this too myself looking at the patch.

Then again there are probably other kinds of refactoring improvements we could make... honestly
I didn't dig deep enough into this one to see if it can be solved just by 'add Appendable
interface to CharsRef' or to even think if thats the right thing to do.

I don't think we should move it out of the analysis package for now (maybe i shouldn't have
put it in util even in the patch) unless there's something else that actually wants to use
it: I think this would be premature.

> [PATCH] KStem for Lucene
> ------------------------
>
>                 Key: LUCENE-152
>                 URL: https://issues.apache.org/jira/browse/LUCENE-152
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: unspecified
>         Environment: Operating System: other
> Platform: Other
>            Reporter: Otis Gospodnetic
>            Priority: Minor
>             Fix For: 3.3, 4.0
>
>         Attachments: LUCENE-152.patch, kstemTestData.zip, lucid_kstem.tgz
>
>
> September 10th 2003 contributionn from "Sergio Guzman-Lara" <guzman@cs.umass.edu>
> Original email:
> Hi all,
>   I have ported the kstem stemmer to Java and incorporated it to 
> Lucene. You can get the source code (Kstem.jar) from the following website:
> http://ciir.cs.umass.edu/downloads/
>   Just click on "KStem Java Implementation" (you will need to register 
> your e-mail, for free of course, with the CIIR --Center for Intelligent 
> Information Retrieval, UMass -- and get an access code).
> Content of Kstem.jar:
> java/org/apache/lucene/analysis/KStemData1.java
> java/org/apache/lucene/analysis/KStemData2.java
> java/org/apache/lucene/analysis/KStemData3.java
> java/org/apache/lucene/analysis/KStemData4.java
> java/org/apache/lucene/analysis/KStemData5.java
> java/org/apache/lucene/analysis/KStemData6.java
> java/org/apache/lucene/analysis/KStemData7.java
> java/org/apache/lucene/analysis/KStemData8.java
> java/org/apache/lucene/analysis/KStemFilter.java
> java/org/apache/lucene/analysis/KStemmer.java
> KStemData1.java, ..., KStemData8.java   Contain several lists of words 
> used by Kstem
> KStemmer.java      Implements the Kstem algorithm 
> KStemFilter.java     Extends TokenFilter applying Kstem
> To compile
> unjar the file Kstem.jar to Lucene's "src" directory, and compile it 
> there. 
> What is Kstem?
>   A stemmer designed by Bob Krovetz (for more information see 
> http://ciir.cs.umass.edu/pubfiles/ir-35.pdf). 
> Copyright issues
>   This is open source. The actual license agreement is included at the 
> top of every source file.
>  Any comments/questions/suggestions are welcome,
>   Sergio Guzman-Lara
>   Senior Research Fellow
>   CIIR UMass

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message