lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: Inquiry on Lucene Stemming
Date Sat, 20 Dec 2008 19:12:21 GMT

: Well some client inquiries if it's possible to expand such simple words 
: and does Lucene have an API for this logic? Because all I read was the 
: stemming logic for Lucene was the other way around which is, example 
: "flashing" it will be trimmed to the root word "flash" when searched.

there are two fundemental approaches (that i know of) to stemming: 
reduction and expansion.

reduction can either be algorithmic, or morphologically/dictionary based.  
expansion esentially has to be morphologically based.

The SnowballAnalyzer uses the Snowball stemming algorithm, which is a 
reduction approach to stemming.  if you want an expansion based appraoch, 
you have to have a dictionary.  Lucene doens't provide one of these, but 
that doens't mean you can't use one if you find one -- it's been a while 
since i looked but the only ones i've ever seen were only commercially 
avialable.

http://en.wikipedia.org/wiki/Stemming




-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message