Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 6562 invoked from network); 6 Oct 2009 00:23:33 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Oct 2009 00:23:33 -0000 Received: (qmail 34771 invoked by uid 500); 6 Oct 2009 00:23:31 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 34713 invoked by uid 500); 6 Oct 2009 00:23:31 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 34702 invoked by uid 99); 6 Oct 2009 00:23:31 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 00:23:31 +0000 X-ASF-Spam-Status: No, hits=3.0 required=10.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pranganathan@netflix.com designates 208.75.77.145 as permitted sender) Received: from [208.75.77.145] (HELO mx2.netflix.com) (208.75.77.145) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 00:23:22 +0000 Received: from bmfilter01.netflix.com ([10.64.32.82]) by mx2.netflix.com (8.12.11.20060308/8.12.11) with ESMTP id n960JqJu002832 for ; Mon, 5 Oct 2009 17:19:52 -0700 X-AuditID: 0a402051-b7c7bae0000049b0-4a-4aca8d27d02f Received: from message.netflix.com (message.netflix.com [10.64.32.68]) by (Symantec Mail Security) with SMTP id D6.24.18864.72D8ACA4; Mon, 5 Oct 2009 17:19:51 -0700 (PDT) Received: from bigmamma.netflix.com ([10.64.32.75]) by message.netflix.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 5 Oct 2009 17:19:49 -0700 Received: from 10.2.177.166 ([10.2.177.166]) by bigmamma.netflix.com ([10.64.32.75]) with Microsoft Exchange Server HTTP-DAV ; Tue, 6 Oct 2009 00:19:25 +0000 User-Agent: Microsoft-Entourage/12.20.0.090605 Date: Mon, 05 Oct 2009 17:19:24 -0700 Subject: Re: Question about PatternReplace filter and automatic Synonym generation From: Prasanna Ranganathan To: "solr-user@lucene.apache.org" Message-ID: Thread-Topic: Question about PatternReplace filter and automatic Synonym generation Thread-Index: AcpDimTZH2f49+7Ga02/RLSslONPcACj/OQ0AAAT+LI= In-Reply-To: Mime-version: 1.0 Content-type: multipart/alternative; boundary="B_3337607964_12322635" X-OriginalArrivalTime: 06 Oct 2009 00:19:49.0802 (UTC) FILETIME=[B7AB20A0:01CA461A] X-Brightmail-Tracker: AAAAAA== X-Virus-Checked: Checked by ClamAV on apache.org --B_3337607964_12322635 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit I just saw the reply from Shalin after sending this email. Kindly excuse. On 10/5/09 5:17 PM, "Prasanna Ranganathan" wrote: > > Can someone please give me some pointers to the questions in my earlier > email? And and every help is much appreciated. > > Regards, > > Prasanna. > > > On 10/2/09 11:01 AM, "Prasanna Ranganathan" wrote: > >> >> Does the PatternReplaceFilter have an option where you can keep the original >> token in addition to the modified token? From what I looked at it does not >> seem to but I want to confirm the same. >> >> Alternatively, is there a filter available which takes in a pattern and >> produces additional forms of the token depending on the pattern? The use case >> I am looking at here is using such a filter to automate synonym generation. >> In our application, quite a few of the synonym file entries match a specific >> pattern and having such a filter would make it easier I believe. Pl. do >> correct me in case I am missing some unwanted side-effect with this approach. >> >> Continuing on that line, what is the performance hit in having additional >> index-time filters as opposed to using a synonym file with more entries? How >> does the overhead of using a bigger synonym file as opposed to additional >> filters compare? >> >> Thanks in advance for the help. >> >> Regards, >> >> Prasanna. --B_3337607964_12322635--