lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Commit Tag Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4817) Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword
Date Fri, 08 Mar 2013 10:38:12 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13597000#comment-13597000
] 

Commit Tag Bot commented on LUCENE-4817:
----------------------------------------

[trunk commit] Simon Willnauer
http://svn.apache.org/viewvc?view=revision&revision=1454313

LUCENE-4817: Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as
keyword

                
> Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword
> --------------------------------------------------------------------------------------
>
>                 Key: LUCENE-4817
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4817
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 4.1
>            Reporter: Simon Willnauer
>            Priority: Minor
>             Fix For: 5.0, 4.3
>
>         Attachments: LUCENE-4817.patch, LUCENE-4817.patch
>
>
> if you want to have a stemmed and an unstemmed version of a token one for recall and
one for precision you have to do two fields today in most of the cases. Yet, most of the stemmers
respect the keyword attribute so we could add a token filter that emits the same token twice
once as keyword and once plain. Folks would most likely need to combine this RemoveDuplicatesTokenFilter
but that way we can have stemmed and unstemmed version in the same field.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message