From java-user-return-22489-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Thu Aug 03 14:21:51 2006 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 24381 invoked from network); 3 Aug 2006 14:21:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 3 Aug 2006 14:21:51 -0000 Received: (qmail 81260 invoked by uid 500); 3 Aug 2006 14:21:44 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 81230 invoked by uid 500); 3 Aug 2006 14:21:44 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 81218 invoked by uid 99); 3 Aug 2006 14:21:43 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Aug 2006 07:21:43 -0700 X-ASF-Spam-Status: No, hits=0.5 required=10.0 tests=DNS_FROM_RFC_ABUSE,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of leandro.saad@gmail.com designates 72.14.214.196 as permitted sender) Received: from [72.14.214.196] (HELO hu-out-0102.google.com) (72.14.214.196) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Aug 2006 07:21:42 -0700 Received: by hu-out-0102.google.com with SMTP id 32so1274974huf for ; Thu, 03 Aug 2006 07:21:21 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=BT4XvAHFHhQdp8d6KrDzKJEsuIzHSj7QeeOREvWcjkK3sgpmRb4NeDkxTd8kUA2IFWU4WPlTlPZRZgXQfpxt0S3IBZKMt/TtT+dxpGgglpISfZzBDAY6a5P7Te469cLx4in+TGEexpwupdWAp0s9gee9s5vMm1fQv6s71ZwAqQk= Received: by 10.78.175.14 with SMTP id x14mr753314hue; Thu, 03 Aug 2006 07:21:21 -0700 (PDT) Received: by 10.78.68.11 with HTTP; Thu, 3 Aug 2006 07:21:20 -0700 (PDT) Message-ID: <27bc81750608030721r61aa377xa55b102a063f8382@mail.gmail.com> Date: Thu, 3 Aug 2006 11:21:20 -0300 From: "Leandro Saad" To: java-user@lucene.apache.org Subject: Re: Search with accents In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_6987_8930386.1154614880906" References: <8220B8D6B56A9A4E8480489F38E42529E17CDC@rw-msg-02.broadvision.com> <27bc81750608021032p5e5cbed4q4ebd41e0af4aadf2@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_6987_8930386.1154614880906 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable Content-Disposition: inline I'm using StandardAnalyser all over, so, yes, portuguese stopwords won't be eliminated --=20 Leandro Rodrigo Saad Cruz CTO - InterBusiness Technologies db.apache.org/ojb guara-framework.sf.net xingu.sf.net On 8/2/06, Eduardo S. Cordeiro wrote: > > But was your index created with BrazilianAnalyzer? Because otherwise > you wouldn't have portuguese stopwords eliminated, like "e", "ou", > etc. > > 2006/8/2, Leandro Saad : > > Hi Eduardo. I'm using the StandardAnalyser and I can search for words > with > > accents. In my case "sa=FAde" > > > > -- > > Leandro Rodrigo Saad Cruz > > CTO - InterBusiness Technologies > > db.apache.org/ojb > > guara-framework.sf.net > > xingu.sf.net > > > > On 8/1/06, Eduardo S. Cordeiro wrote: > > > > > > Yes...here's how I create my QueryParser: > > > > > > QueryParser parser =3D new QueryParser("text", new BrazilianAnalyzer(= )); > > > > > > 2006/8/1, Zhang, Lisheng : > > > > Hi, > > > > > > > > Have you used the same BrazilianAnalyzer when > > > > searching? > > > > > > > > Best regards, Lisheng > > > > > > > > -----Original Message----- > > > > From: Eduardo S. Cordeiro [mailto:escordeiro@gmail.com] > > > > Sent: Tuesday, August 01, 2006 1:40 PM > > > > To: java-user@lucene.apache.org > > > > Subject: Search with accents > > > > > > > > > > > > Hello there, > > > > > > > > I have a brazilian portuguese index, which has been analyzed with > > > > BrazilianAnalyzer. When searching words with accents, however, > they're > > > > not found -- for instance, if the index contains some text with the > > > > word "ma=E7=E3" and I search for that very word, I get no hits, but= if I > > > > search "maca" (which is another portuguese word) then the document > > > > containing "ma=E7=E3" is found. > > > > > > > > I've seen posts in the archive indicating that I should use > > > > ISOLatin1AccentFilter to handle this, but I don't quite see how: > > > > should I leave indexation as it is and use this filter only for > search > > > > queries or should I apply it in both cases? > > > > > > > > Thank you, > > > > Eduardo Cordeiro > > > > > > > > > --------------------------------------------------------------------- > > > > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > > > > For additional commands, e-mail: java-user-help@lucene.apache.org > > > > > > > > > > > > > > > > ------=_Part_6987_8930386.1154614880906--