Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 81711 invoked from network); 3 Oct 2010 20:09:21 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 3 Oct 2010 20:09:21 -0000 Received: (qmail 57066 invoked by uid 500); 3 Oct 2010 20:09:18 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 56998 invoked by uid 500); 3 Oct 2010 20:09:17 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 56990 invoked by uid 99); 3 Oct 2010 20:09:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Oct 2010 20:09:17 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.191.84.215] (HELO web82102.mail.mud.yahoo.com) (209.191.84.215) by apache.org (qpsmtpd/0.29) with SMTP; Sun, 03 Oct 2010 20:09:09 +0000 Received: (qmail 90265 invoked by uid 60001); 3 Oct 2010 20:08:48 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sbcglobal.net; s=s1024; t=1286136528; bh=qVZ3OvCoeen8RMTiy4OqVgBzXN7rUNzJbooi+Ns+sq4=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=gQ6r1vU2im3ur4g4w9aZ7sNGiThosO6CIdadip/XWlMmiS1uoRTautkvLKYCbr534bnTUtVVIB3EtETgJXHKYYgSNywpNUJEh8CV8497Q3YwJiqfRKdlF9W1j8k4wzWasMNBv2bn/bsMDrjR+r9ji1pYxfeGUHU4WjC42jUpxNc= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=sbcglobal.net; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=M0DgC9XkLAHrsoRu1F95XlQj/crXBYSWNb/pAZnLmJ3CflCH0txMtPTwBJPOHUwWQlkBVoUu8P4lrt4e9A4MpOFXN6OGmVXV0RZIGjRFB+lqBnHGe3LBYBt8sPOfDgBCma0fCestE0RI5c3x+jQ8WYRC7fNWjfxm5G166nzbcxk=; Message-ID: <301511.88410.qm@web82102.mail.mud.yahoo.com> X-YMail-OSG: jvCesVQVM1lMVjcyNp1qlZpf4Z_L4mAwgyiWhzQMOjGGDUi BnyRcDRWt27sKy88BU9OHtnA5SP2B6vLnGGOi0Sge8GUETxegSOlBUtiyN3v eynGK33wbqjvO64hYEBwj1vVkccH7.FeLmf6I1Wo.RTZpN7zgupRUKXVjas6 fCWu1Ut0Xp62GTLI1oN88CTO7cxeuN2s4yT.X0m46u9Og3VyCOzvNLXo3pNz nrzIFY8D0W0sN9DeAOT1Nga7CmcVSWtib3dI9UqJui3vBs_wTdek86JQ3SfE C9YCVevmFXT_LwTYorESsuYHkJtbu5A0MbopMx7_v5t5lJov36721RT23cTa XtQ-- Received: from [68.183.64.79] by web82102.mail.mud.yahoo.com via HTTP; Sun, 03 Oct 2010 13:08:48 PDT X-Mailer: YahooMailClassic/11.4.9 YahooMailWebService/0.8.105.279950 Date: Sun, 3 Oct 2010 13:08:48 -0700 (PDT) From: Dennis Gearon Subject: Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags? To: solr-user@lucene.apache.org In-Reply-To: <472828.26348.qm@web52906.mail.re2.yahoo.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org What's the difference between the filter/anayzers that have 'factory' in th= eir name, and the ones that don't?=0A=0A=0ADennis Gearon=0A=0ASignature War= ning=0A----------------=0AEARTH has a Right To Life,=0A otherwise we all d= ie.=0A=0ARead 'Hot, Flat, and Crowded'=0ALaugh at http://www.yert.com/film.= php=0A=0A=0A--- On Sun, 10/3/10, Ahmet Arslan wrote:=0A= =0A> From: Ahmet Arslan =0A> Subject: Re: NGramFilterFac= tory for auto-complete that matches the middle of multi-lingual tags?=0A> T= o: solr-user@lucene.apache.org=0A> Date: Sunday, October 3, 2010, 3:26 AM= =0A> > But I thought NGramFilterFactory=0A> would generate substrings=0A> >= that start in the "middle", hence ensuring=0A> autocomplete=0A> > matching= in the middle.=0A> > =0A> > So in the case of "electric guitar", keywordto= kenizer=0A> would=0A> > create one token - "electric guitar"=0A> > =0A> > N= GramFilterFactory would then take that one toke=0A> ("electric=0A> > guitar= ") and generate N-grams out of it. One of the=0A> ngrams=0A> > would be "gu= it" because "guit" is a substring of=0A> "electric=0A> > guitar".=0A> > =0A= > =0A> Ups. You are correct, I am sorry. I mixed it with=0A> *Edge*NGramFil= terFActory.=0A> =0A> =0A> =A0 =A0 =A0 =0A>