lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pete Lewis" <p...@uptima.co.uk>
Subject Re: PorterStemfilter
Date Tue, 14 Sep 2004 18:44:46 GMT
Hi David

I like KStem more than Porter / Snowball - but still has limitations
although performs better as it has a dictionary to augment the rules.

Note that KStem will also treat "print" and "printer" as two distinct terms,
probably treating it as verb and noun respectively.

Cheers

Pete Lewis

----- Original Message ----- 
From: "David Spencer" <dave-lucene-user@tropo.com>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Tuesday, September 14, 2004 7:19 PM
Subject: Re: PorterStemfilter


> Honey George wrote:
>
> > Hi,
> >  This might be more of a questing related to the
> > PorterStemmer algorithm rather than with lucene, but
> > if anyone has the knowledge please share.
>
> You might want to also try the Snowball stemmer:
>
> http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/
>
> And KStem:
>
> http://ciir.cs.umass.edu/downloads/
> >
> > I am using the PorterStemFilter that some with lucene
> > and it turns out that searching for the word 'printer'
> > does not return a document containing the text
> > 'print'. To narrow down the problem, I have tested the
> > PorterStemFilter in a standalone programs and it turns
> > out that the stem of printer is 'printer' and not
> > 'print'. That is 'printer' is not equal to 'print' +
> > 'er', the whole of the word is stem. Can somebody
> > explain the behavior.
> >
> > Thanks & Regards,
> >    George
> >
> >
> >
> >
> >
> > ___________________________________________________________ALL-NEW
Yahoo! Messenger - all new features - even more fun!
http://uk.messenger.yahoo.com
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> > For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message