Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 26952 invoked from network); 4 Feb 2010 22:17:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Feb 2010 22:17:16 -0000 Received: (qmail 48264 invoked by uid 500); 4 Feb 2010 22:17:10 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 48191 invoked by uid 500); 4 Feb 2010 22:17:10 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 48172 invoked by uid 99); 4 Feb 2010 22:17:10 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Feb 2010 22:17:10 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jason.rutherglen@gmail.com designates 209.85.216.186 as permitted sender) Received: from [209.85.216.186] (HELO mail-px0-f186.google.com) (209.85.216.186) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Feb 2010 22:17:02 +0000 Received: by pxi16 with SMTP id 16so515650pxi.29 for ; Thu, 04 Feb 2010 14:16:41 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=TiNV8rACrnz5nzKzhIv9rdEAbym8lbrYXdxOsZ+crzw=; b=SET8fWGlQLzdFeHAgTyYfs6NfDNQS2rfhx6BIkXnvG8WmdFOSAORrsxJEdn9g5EGaA 4JPKQe6LKHWD8hXZyGVFE2zrQomwEezNJ10Mx4TRvCLCkCispNVbybW490pIjNGj06JC 4L4IJ2TUgC0wgO0uDQKZSDn93WtZm/4q1Ubvc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=XcRyUxcm3wKIc7QK+6ehTNhyISyhkY9wrzQLBWkGuqjUlK6YP4VnsDAVM2Qrhj0eKy nvBKUAjtJQa/m38sJbKi0ybcWBlaU9puiT12WoBWcAPJucRLva+RwAGa9dbTxVeDXebr aRe+gCQet3lwMDa2KRITsihRweNyGuZ3oRaHg= MIME-Version: 1.0 Received: by 10.141.107.14 with SMTP id j14mr1183734rvm.78.1265321801155; Thu, 04 Feb 2010 14:16:41 -0800 (PST) In-Reply-To: <2D127F11DC79714E9B6A43AC9458147F36661F8E@suex07-mbx-03.ad.syr.edu> References: <85d3c3b61002040918g2b5dc269hd350e33ede280bec@mail.gmail.com> <2D127F11DC79714E9B6A43AC9458147F36661F8E@suex07-mbx-03.ad.syr.edu> Date: Thu, 4 Feb 2010 14:16:41 -0800 Message-ID: <85d3c3b61002041416t30770bd6r3747fcdb5e0a749d@mail.gmail.com> Subject: Re: Analyzer for stripping non alpha-numeric characters? From: Jason Rutherglen To: java-user@lucene.apache.org, solr-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Transferred partially to solr-user... Steven, thanks for the reply! I wonder if PatternReplaceFilter can output multiple tokens? I'd like to progressively strip the non-alphanums, for example output: apple!&* apple!& apple! apple On Thu, Feb 4, 2010 at 12:18 PM, Steven A Rowe wrote: > Hi Jason, > > Solr's PatternReplaceFilter(ts, "\\P{Alnum}+$", "", false) should work, chained after an appropriate tokenizer. > > Steve > > On 02/04/2010 at 12:18 PM, Jason Rutherglen wrote: >> Is there an analyzer that easily strips non alpha-numeric from the end >> of a token? >> >> --------------------------------------------------------------------- To >> unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For >> additional commands, e-mail: java-user-help@lucene.apache.org > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org