Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 52603 invoked from network); 1 Oct 2007 13:45:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 1 Oct 2007 13:45:26 -0000 Received: (qmail 57855 invoked by uid 500); 1 Oct 2007 13:45:08 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 57825 invoked by uid 500); 1 Oct 2007 13:45:07 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 57811 invoked by uid 99); 1 Oct 2007 13:45:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Oct 2007 06:45:07 -0700 X-ASF-Spam-Status: No, hits=0.2 required=10.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: 80.76.149.212 is neither permitted nor denied by domain of karl.wettin@gmail.com) Received: from [80.76.149.212] (HELO ch-smtp01.sth.basefarm.net) (80.76.149.212) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Oct 2007 13:45:08 +0000 Received: from c83-249-118-113.bredband.comhem.se ([83.249.118.113]:55855 helo=[192.168.1.101]) by ch-smtp01.sth.basefarm.net with esmtp (Exim 4.66) (envelope-from ) id 1IcLZm-0007iS-5Q for java-user@lucene.apache.org; Mon, 01 Oct 2007 15:44:47 +0200 Mime-Version: 1.0 (Apple Message framework v752.3) In-Reply-To: <4700F72B.1010609@propylon.com> References: <4700F72B.1010609@propylon.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Message-Id: <0C09F3B2-8C01-4C61-970E-B4673971B526@gmail.com> Content-Transfer-Encoding: quoted-printable From: Karl Wettin Subject: Re: Indexing puncuation and symbols Date: Mon, 1 Oct 2007 15:37:18 +0200 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.752.3) X-Originating-IP: 83.249.118.113 X-Scan-Result: No virus found in message 1IcLZm-0007iS-5Q. X-Scan-Signature: ch-smtp01.sth.basefarm.net 1IcLZm-0007iS-5Q fefc1d7eabdfdf53eab36070e41724a8 X-Virus-Checked: Checked by ClamAV on apache.org 1 okt 2007 kl. 15.33 skrev John Byrne: > Has anyone written an analyzer that preserves puncuation and > synmbols ("=A3", "$", "%" etc.) as tokens? WhitespaceAnalyzer? You could also extend the lexical rules of StandardAnalyzer. --=20 karl= --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org