Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9DEDADB2F for ; Fri, 8 Mar 2013 11:02:16 +0000 (UTC) Received: (qmail 38811 invoked by uid 500); 8 Mar 2013 11:02:15 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 38725 invoked by uid 500); 8 Mar 2013 11:02:14 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 38533 invoked by uid 99); 8 Mar 2013 11:02:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Mar 2013 11:02:13 +0000 Date: Fri, 8 Mar 2013 11:02:13 +0000 (UTC) From: "Commit Tag Bot (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (LUCENE-4817) Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LUCENE-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13597017#comment-13597017 ] Commit Tag Bot commented on LUCENE-4817: ---------------------------------------- [branch_4x commit] Simon Willnauer http://svn.apache.org/viewvc?view=revision&revision=1454317 LUCENE-4817: Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword > Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword > -------------------------------------------------------------------------------------- > > Key: LUCENE-4817 > URL: https://issues.apache.org/jira/browse/LUCENE-4817 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis > Affects Versions: 4.1 > Reporter: Simon Willnauer > Priority: Minor > Fix For: 5.0, 4.3 > > Attachments: LUCENE-4817.patch, LUCENE-4817.patch > > > if you want to have a stemmed and an unstemmed version of a token one for recall and one for precision you have to do two fields today in most of the cases. Yet, most of the stemmers respect the keyword attribute so we could add a token filter that emits the same token twice once as keyword and once plain. Folks would most likely need to combine this RemoveDuplicatesTokenFilter but that way we can have stemmed and unstemmed version in the same field. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org