Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 29954 invoked from network); 13 Jan 2011 04:06:33 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Jan 2011 04:06:33 -0000 Received: (qmail 93270 invoked by uid 500); 13 Jan 2011 04:06:32 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 92746 invoked by uid 500); 13 Jan 2011 04:06:27 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 92738 invoked by uid 99); 13 Jan 2011 04:06:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Jan 2011 04:06:26 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of rcmuir@gmail.com designates 209.85.214.48 as permitted sender) Received: from [209.85.214.48] (HELO mail-bw0-f48.google.com) (209.85.214.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Jan 2011 04:06:21 +0000 Received: by bwz8 with SMTP id 8so1329794bwz.35 for ; Wed, 12 Jan 2011 20:05:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type:content-transfer-encoding; bh=uj59HhqLsxl4LAIgO2t+m8IlB72wXZ4C8iwdS5PTp1k=; b=O61aZl0JwXUsRDo+9dtT71umOUJzJV30EFTxizLAbKzvrZOZS9N0LMtDa34oFVONck bJYR73jPpobt7sDO/QLuiApXMMlvfa25lR72aPw7M/FCFD/sNh/KqMjXNyIZDi57iM30 ijwRUHT9wgmnvmwAfXQ3uoVNk8kOpfbHATvno= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=E7e9PVMJ6SdBjVKpLwzx4nV0OgroiVGe24D4v+g56J3Bpy5urI/9TS2H/G7XpeEQh5 1LS3b/rOkzI/KkhiB1M7N7IC5ZnKdjh2oJfNrO2MaSbSwpP3qiuqXpxuwpY4vAiyCtNb QUeFrgfQbrFjq1alv0mXmlZJ0JFQ8XPtV+St8= Received: by 10.204.113.9 with SMTP id y9mr1357771bkp.201.1294891559329; Wed, 12 Jan 2011 20:05:59 -0800 (PST) MIME-Version: 1.0 Received: by 10.204.80.85 with HTTP; Wed, 12 Jan 2011 20:05:39 -0800 (PST) In-Reply-To: <4D2E73A3.2020408@member.fsf.org> References: <4D2E73A3.2020408@member.fsf.org> From: Robert Muir Date: Wed, 12 Jan 2011 23:05:39 -0500 Message-ID: Subject: Re: "or" as a search term To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier wrote: > Hi, > > I am happily using Lucene for several years to offer French lexical analy= sis > tools to university researchers. =C2=A0 Today, one of them decided to ana= lyze the > use of the French word "or" (meaning "gold" in French) in one of my corpu= s > powered by Lucene... =C2=A0And, as you probably already guessed, no resul= ts... > What analyzer are you using? By default, StandardAnalyzer and StopAnalyzer uses a set of english stopwords. For french, this list is probably not appropriate. If you look at the javadocs, you can pass in your own set of stopwords... for lexical analysis maybe this should be an empty set. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org