Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 85295 invoked from network); 4 Jun 2007 15:54:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Jun 2007 15:54:26 -0000 Received: (qmail 62966 invoked by uid 500); 4 Jun 2007 15:54:26 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 62915 invoked by uid 500); 4 Jun 2007 15:54:26 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 62904 invoked by uid 99); 4 Jun 2007 15:54:26 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jun 2007 08:54:26 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [128.230.18.29] (HELO mailer.syr.edu) (128.230.18.29) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jun 2007 08:54:20 -0700 Received: from [128.230.84.100] (ist-h335-d03.syr.edu) by mailer.syr.edu (LSMTP for Windows NT v1.1b) with SMTP id <0.1642125F@mailer.syr.edu>; Mon, 4 Jun 2007 11:53:59 -0400 Message-ID: <46643553.9050205@syr.edu> Date: Mon, 04 Jun 2007 11:52:51 -0400 From: Steven Rowe User-Agent: Mail/News 1.5.0.4-GroupWise-IMAP-fix (Windows/20060619) MIME-Version: 1.0 To: java-dev@lucene.apache.org Subject: Re: using a french specific analyser without stemming References: In-Reply-To: X-Enigmail-Version: 0.94.3.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Bonjour Mathieu, Mathieu Lecarme wrote: > For a project with a lot ofLucene search (via Compass), I had some > troubles with stemming. Stemming is nice for enlarge search range, but > make completion strange. > So FrenchAnalyzer was not usable. A simpler StandardAnalyzer makes the > job right, except for some french speciality, like elision. In french > "the plane" is translated by "l'avion" and not "le avion", and the > StandardTokenizer, used by StandardFilter can't tokenize it right. So, I > make a specific filter (ElisionFilter), how can I give it to Lucene? > With a Jira ticket, with the mailing list? Here's a good place to start: http://wiki.apache.org/jakarta-lucene/HowToContribute FYI, 99% of modifications/additions to Lucene begin life as JIRA issues. -- Steve Rowe Center for Natural Language Processing http://www.cnlp.org/tech/lucene.asp --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org