Return-Path: X-Original-To: apmail-lucy-user-archive@www.apache.org Delivered-To: apmail-lucy-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 894E99A76 for ; Wed, 13 Jun 2012 23:56:51 +0000 (UTC) Received: (qmail 27625 invoked by uid 500); 13 Jun 2012 23:56:51 -0000 Delivered-To: apmail-lucy-user-archive@lucy.apache.org Received: (qmail 27585 invoked by uid 500); 13 Jun 2012 23:56:51 -0000 Mailing-List: contact user-help@lucy.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucy.apache.org Delivered-To: mailing list user@lucy.apache.org Received: (qmail 27573 invoked by uid 99); 13 Jun 2012 23:56:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jun 2012 23:56:51 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of svasekar@listenlogic.com designates 209.85.212.181 as permitted sender) Received: from [209.85.212.181] (HELO mail-wi0-f181.google.com) (209.85.212.181) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jun 2012 23:56:46 +0000 Received: by wibhn14 with SMTP id hn14so1005026wib.4 for ; Wed, 13 Jun 2012 16:56:24 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=Diw8NOi1nSET02o9ngJZ15Fn+HUZcRzNBPd0A7+5I/A=; b=aAOyl4FHEvCbQS3HpFa15xGxEnBMsurNvStb57HJM4pf8nwshco9xQs1Ov75C3pCOj 1XQEmoWvYmDSYtR1acsmmaMAiSL2vrfqWBbOGyXKmo7PRr8ZkO0MbMNhdbtBRHXMv+qT Zn5oJEK2qwgIvYAHSZQTRVMHcJ/omnqoM1zN6lB7xSCTp9doqX7UJ98oynJB8ZbD0oiw rXFWq+TQVWLkN8JZMsVmpKpkikQMThYVdUSxgShBfEbUEjqBsHLRzrid3G1w0z2OTaaJ H4VvlohQGZ6B8ueGfSTQypBUFAsDWbWFCsSipH2B0IyOzVjp7CgMU7LSP777eKjQyBht 9mKg== MIME-Version: 1.0 Received: by 10.180.8.69 with SMTP id p5mr41624740wia.17.1339631784560; Wed, 13 Jun 2012 16:56:24 -0700 (PDT) Received: by 10.194.45.70 with HTTP; Wed, 13 Jun 2012 16:56:24 -0700 (PDT) In-Reply-To: <4FD8F02E.6080906@peknet.com> References: <4FD8F02E.6080906@peknet.com> Date: Wed, 13 Jun 2012 16:56:24 -0700 Message-ID: From: Saurabh Vasekar To: user@lucy.apache.org Content-Type: multipart/alternative; boundary=f46d044283b6a6083e04c2635456 X-Gm-Message-State: ALoCoQkmumnFOBVsaJOAMHAhx8GPEFdZ6Dq5VNI7EDR+xw8fusV/LzhT8WXfMrzH6NFelBeE5shS X-Virus-Checked: Checked by ClamAV on apache.org Subject: Re: [lucy-user] How to add more languages in an analyzer and change path to store indexed documents --f46d044283b6a6083e04c2635456 Content-Type: text/plain; charset=ISO-8859-1 Thanks a lot for your help! On Wed, Jun 13, 2012 at 12:55 PM, Peter Karman wrote: > Saurabh Vasekar wrote on 6/13/12 2:29 PM: > > Hello, > > > > I am a beginner to Lucy. This is the first time I am using a Search > > library. I went through the tutorial at lucy.apache.org. I am confused > over > > the following things mentioned in the tutorial. > > > > The tutorial mentions that we can specify the language in which the > > documents are. Hence while indexing how can I specify multiple languages > in > > the analyzers if my documents are in different languages. > > > > my $polyanalyzer = Lucy::Analysis::PolyAnalyzer->new( > > language => 'en', > > ) > > > > note that you likely don't want to specify multiple languages for a single > index, because the stemming (for example) rules applied will be > confused/confusing. I.e., Lucy doesn't do language *detection* -- it just > performs language-specific analysis based on the kind of documents you > hand to > the analyzer. > > > -- > Peter Karman . http://peknet.com/ . peter@peknet.com > --f46d044283b6a6083e04c2635456--