Return-Path: X-Original-To: apmail-lucenenet-user-archive@www.apache.org Delivered-To: apmail-lucenenet-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6153D10969 for ; Sat, 7 Mar 2015 17:13:34 +0000 (UTC) Received: (qmail 45179 invoked by uid 500); 7 Mar 2015 17:13:34 -0000 Delivered-To: apmail-lucenenet-user-archive@lucenenet.apache.org Received: (qmail 45126 invoked by uid 500); 7 Mar 2015 17:13:34 -0000 Mailing-List: contact user-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucenenet.apache.org Delivered-To: mailing list user@lucenenet.apache.org Received: (qmail 45114 invoked by uid 99); 7 Mar 2015 17:13:33 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 07 Mar 2015 17:13:33 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,TVD_FROM_1 X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of itamar.synhershko@gmail.com designates 209.85.214.182 as permitted sender) Received: from [209.85.214.182] (HELO mail-ob0-f182.google.com) (209.85.214.182) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 07 Mar 2015 17:13:09 +0000 Received: by obbnt9 with SMTP id nt9so1800423obb.12 for ; Sat, 07 Mar 2015 09:11:37 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=a6M5ctJ0AuP/NYYfrzE0mxXul3Rz6CnzyZAsZlwijCI=; b=B6l3kDAdkNRAGV5gU3roZiSwwLsAW8yU+yt9tKl7TVlKl5o2krkGlF0gHEKClrHsXG Tx9aX/kGEAD9JqxxkdiciGFcASqdCJjaDazELIT1RXR1xDtkyHpD8sAb1OZ5OdofAh4p AIaD9lM/0wwo+yvxjkuHJWQi7gNKGLEsmPMEFc7eNTkByMYW5vfEokgDqAOLLy85i291 3DSJjE2IiuzG9/h3WIjLO4P1aOuWH9gpzymZJ3S1gMcG2gXhXupYf7FCeG75ZJdU4eZj 9A1MPnsIbb9vtr3kDv18vdWYBlqJykj3wuNHrXgJifqwpSlaFJmK1FUU0KkruOySQc6w iLMQ== MIME-Version: 1.0 X-Received: by 10.182.133.68 with SMTP id pa4mr15442838obb.87.1425748297436; Sat, 07 Mar 2015 09:11:37 -0800 (PST) Sender: itamar.synhershko@gmail.com Received: by 10.182.197.98 with HTTP; Sat, 7 Mar 2015 09:11:37 -0800 (PST) In-Reply-To: References: Date: Sat, 7 Mar 2015 19:11:37 +0200 X-Google-Sender-Auth: q3N9kmt1J4Gbl9roMKUpXN2SV4A Message-ID: Subject: Re: Using French Analyzer From: Itamar Syn-Hershko To: user@lucenenet.apache.org Content-Type: multipart/alternative; boundary=e89a8ff1ce1eced5ab0510b5e45d X-Virus-Checked: Checked by ClamAV on apache.org --e89a8ff1ce1eced5ab0510b5e45d Content-Type: text/plain; charset=UTF-8 I'm going to suggest you wait for 4.8.0, Analyzers provided with that release will work properly. -- Itamar Syn-Hershko http://code972.com | @synhershko Freelance Developer & Consultant Lucene.NET committer and PMC member On Thu, Mar 5, 2015 at 8:36 PM, Shubhanshu Pathak < shubhanshupathak30@gmail.com> wrote: > Dear Group Members, > > I am using Lucene.Net 3.0.3 > > In one of my projects I have to do language based analysis. > > When I was trying to use already in place analyzer for the French language > FrenchAnalyzer, I came to know the fact that internally it uses > FrenchStemFilter. > The documentation of this Filter says that "Don't use me" - > > This stemmer does not implement the Snowball algorithm correctly, > > especially involving case problems. > It is recommended that you consider using the "French" stemmer in the > snowball package instead. > This stemmer will likely be deprecated in a future release. > > This means I should not use this Analyzer. > > Then I tried using on SnowballAnalyzer. It provides me a way to do > linguistic > analysis through > > Analyzer analyzer = new SnowballAnalyzer(Version.LUCENE_30, "French"); > > Now when I look at the code of the SnowballAnalyzer - > > In it's constructor it invokes a method > > SetOverridesTokenStreamMethod(); > > of the base class Analyzer. > This method is already marked as obsolete. > > [Obsolete("This is only present to preserve back-compat of classes that > subclass a core analyzer and override tokenStream but not > reusableTokenStream ")] > protected internal virtual void SetOverridesTokenStreamMethod() > > This means we can not use the SnowballAnalyzer as well for a long run. > > > So kindly let me know how to achieve the linguistic analysis in such cases > apart from building our own Analyzer. > > Thanks & Regards, > Shubhanshu > --e89a8ff1ce1eced5ab0510b5e45d--