lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer (JIRA)" <>
Subject [jira] [Commented] (LUCENE-3415) Snowball filter to include original word too
Date Tue, 06 Sep 2011 09:52:12 GMT


Simon Willnauer commented on LUCENE-3415:

instead of modifying snowball filter you could write a filter that buffers the term and emits
it twice. First you simply pass on the term and the second time you set KeywordAttribute#setKeyword(boolean)
to true. This will force the stemmer to ignore this term an pass it along the tokenstream
pipeline without modification. Would that solve your problem? I am not sure we should actually
provide such a filter but others have more insight into this, robert?

> Snowball filter to include original word too
> --------------------------------------------
>                 Key: LUCENE-3415
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.3
>         Environment: All
>            Reporter: Manish
>              Labels: features
>             Fix For: 3.4, 4.0
> 1. Currently, snowball filter deletes the original word and adds the stemmed word to
the index. So, if i want to do search with / without stemming, i have to keep 2 fields, one
with stemming and one without it. 
> 2. Rather than doing this, if we have configurable item to preserve original, it would
solve more business problem. 
> 3. Using single field, i can do search using stemming / without stemming by changing
the query filters. 
> The same can also be done for phonetic filters too. 

This message is automatically generated by JIRA.
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message