Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 89948 invoked from network); 25 May 2007 10:37:43 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 May 2007 10:37:43 -0000 Received: (qmail 82874 invoked by uid 500); 25 May 2007 10:37:42 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 82836 invoked by uid 500); 25 May 2007 10:37:42 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 82818 invoked by uid 99); 25 May 2007 10:37:42 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 May 2007 03:37:42 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 May 2007 03:37:36 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 27F76714058 for ; Fri, 25 May 2007 03:37:16 -0700 (PDT) Message-ID: <31209477.1180089436123.JavaMail.jira@brutus> Date: Fri, 25 May 2007 03:37:16 -0700 (PDT) From: "Karl Wettin (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Created: (LUCENE-889) Standard tokenizer with punctuation output MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Standard tokenizer with punctuation output ------------------------------------------ Key: LUCENE-889 URL: https://issues.apache.org/jira/browse/LUCENE-889 Project: Lucene - Java Issue Type: Improvement Affects Versions: 2.1 Reporter: Karl Wettin Priority: Trivial This patch adds punctuation (comma, period, question mark and exclamation point) tokens as output from the StandardTokenizer, and filters them out in the StandardFilter. (I needed them for text classification reasons.) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org