Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5EED5E6A8 for ; Mon, 11 Mar 2013 12:54:00 +0000 (UTC) Received: (qmail 27181 invoked by uid 500); 11 Mar 2013 12:52:08 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 26320 invoked by uid 500); 11 Mar 2013 12:52:08 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 22007 invoked by uid 99); 11 Mar 2013 12:49:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Mar 2013 12:49:12 +0000 Date: Mon, 11 Mar 2013 12:49:12 +0000 (UTC) From: "Uwe Schindler (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (LUCENE-4822) Add PatternKeywordTokenFilter to marks keywords based on regular expressions MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LUCENE-4822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13598780#comment-13598780 ] Uwe Schindler commented on LUCENE-4822: --------------------------------------- In addition: CharArraySet allows to do "contains(CharSequence)", so we dont need to pass the array directly. contains(termAtt) is enough to check for existence in the set. If you dont want to remove the parameter of isKeyword, make it CharSequence. > Add PatternKeywordTokenFilter to marks keywords based on regular expressions > ---------------------------------------------------------------------------- > > Key: LUCENE-4822 > URL: https://issues.apache.org/jira/browse/LUCENE-4822 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis > Affects Versions: 4.2 > Reporter: Simon Willnauer > Priority: Minor > Fix For: 5.0, 4.3 > > Attachments: LUCENE-4822.patch > > > today we need to pass in an explicit set of terms that we want to marks as keywords. It might make sense to allow patterns as well to prevent certain suffixes etc. to be keyworded. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org