Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 75D4010412 for ; Wed, 14 Jan 2015 13:46:56 +0000 (UTC) Received: (qmail 31866 invoked by uid 500); 14 Jan 2015 13:46:53 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 31793 invoked by uid 500); 14 Jan 2015 13:46:53 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 31781 invoked by uid 99); 14 Jan 2015 13:46:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Jan 2015 13:46:52 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jack.krupansky@gmail.com designates 209.85.212.172 as permitted sender) Received: from [209.85.212.172] (HELO mail-wi0-f172.google.com) (209.85.212.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Jan 2015 13:46:27 +0000 Received: by mail-wi0-f172.google.com with SMTP id n3so28214870wiv.5 for ; Wed, 14 Jan 2015 05:46:26 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=RHGLfsIB9AHbibpuyXNWVwPtmtsX6EaGLOlJSHhaCjw=; b=LFhYUdbdzI6+4c4ZQPGsmbtJgPjJBYrek3gK34ljVBPe+orZvW1l9b+vxZ9GWNcZW5 xMzlJianJmgj0EJKluULztI7KyNBFpnbMH2UKk2wsoboX07ZxDbV2ZGt7roNBOLzRDun 03t8MJlpDqoA/vY3Z4jgmXCeVkDem9iZkY8oxdE/xm1dPmNWKLkJxCz0zUR6CCVzCFs1 jahH6ZKZTuhobXT3Ad4HSMVqPWTWijsTGkhIq5ICWe+St5DFLFWAIcLULIE4v6dgg3sC uAFgXJpVkV6CFkVhTBxL/NHzl5aHZJ9rvbE6PBWxxfTbAG3qBvRLN426FBygDjVssR3s dkXQ== MIME-Version: 1.0 X-Received: by 10.180.93.167 with SMTP id cv7mr9159533wib.12.1421243186307; Wed, 14 Jan 2015 05:46:26 -0800 (PST) Received: by 10.27.48.204 with HTTP; Wed, 14 Jan 2015 05:46:26 -0800 (PST) In-Reply-To: <1421242648500-4179496.post@n3.nabble.com> References: <1420804050681-4178346.post@n3.nabble.com> <1421158864172-4179173.post@n3.nabble.com> <1421237769138-4179477.post@n3.nabble.com> <1421242648500-4179496.post@n3.nabble.com> Date: Wed, 14 Jan 2015 08:46:26 -0500 Message-ID: Subject: Re: Tokenizer or Filter ? From: Jack Krupansky To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=f46d042697064266f4050c9cf74c X-Virus-Checked: Checked by ClamAV on apache.org --f46d042697064266f4050c9cf74c Content-Type: text/plain; charset=UTF-8 I was suspecting it might do that - the pattern is "greedy" and takes the longest matching pattern. Add a question mark after the asterisk to use stingy mode that matches the shortest pattern. -- Jack Krupansky On Wed, Jan 14, 2015 at 8:37 AM, tomas.kalas wrote: > I just used Solr UI Analyzer for my test, or must i indexed it firstly? > > I used this XML code in my schema: > > positionIncrementGap="100"> > > pattern="<d1>.*</d1>" replacement=""/> > > > > > This is my result: > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Tokenizer-or-Filter-tp4178346p4179496.html > Sent from the Solr - User mailing list archive at Nabble.com. > --f46d042697064266f4050c9cf74c--