lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mathieu Lecarme <math...@garambrogne.net>
Subject Re: reg-ex based stop word removal
Date Wed, 22 Aug 2007 12:00:33 GMT
sandeep chawla a écrit :
> Hi  ,
>
> I am working on a search application . This application requires me to
> implement a stop filter
> using a stop word list. I have implemented a stop filter using lucene's API.
>
> I want to take my application one step further.
>
> I want to remove all the words which match a particular regular expression.
>
> For Ex- if my  expression is a?d. I should be able to remove words
> like  "and", "ad" ,"abd",
> "acd"  etc etc.
>
> I would appreciate any help/suggestions in this matter
>
> thanks
> sandeep
>
>   
implement a TokenFilter
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/analysis/TokenFilter.html

M.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message