Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 58884 invoked from network); 6 Aug 2008 12:56:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Aug 2008 12:56:51 -0000 Received: (qmail 61024 invoked by uid 500); 6 Aug 2008 12:56:43 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 60988 invoked by uid 500); 6 Aug 2008 12:56:42 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 60977 invoked by uid 99); 6 Aug 2008 12:56:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Aug 2008 05:56:42 -0700 X-ASF-Spam-Status: No, hits=2.6 required=10.0 tests=DNS_FROM_OPENWHOIS,SPF_HELO_PASS,SPF_PASS,WHOIS_MYPRIVREG X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Aug 2008 12:55:46 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1KQiYn-0000cD-74 for java-user@lucene.apache.org; Wed, 06 Aug 2008 05:56:13 -0700 Message-ID: <18850615.post@talk.nabble.com> Date: Wed, 6 Aug 2008 05:56:13 -0700 (PDT) From: Christophe from paris To: java-user@lucene.apache.org Subject: Re: search with accent not match In-Reply-To: <489998A6.7040005@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Nabble-From: zlink_fr@yahoo.fr References: <18848522.post@talk.nabble.com> <489998A6.7040005@gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org Actualy in my FrenchAnalyser=20 i have : TokenStream result =3D new StandardTokenizer(reader); result =3D new StandardFilter(result); result =3D new StopFilter(result, stoptable); result =3D new FrenchStemFilter(result, excltable); result =3D new LowerCaseFilter(result); I can use ISOLatin1AccentFilter in this Class for indexing ans search ? And it is the case where ? markrmiller wrote: >=20 > Check out org.apache.lucene.analysis.ISOLatin1AccentFilter >=20 > It will strip diacritics - just be sure to use it at index time and=20 > query time to get what you want. Also, you will no longer be able to=20 > differentiate between the two in your searching (rarely that important=20 > in my opinion, but others certainly disagree). >=20 > - Mark >=20 > Christophe from paris wrote: >> Hello >> >> I'm use FrenchAnalyzer for index=20 >> >> IndexWriter writer =3D new IndexWriter(pathOfIndex, new FrenchAnalyzer()= , >> true); >> Document =3D new Document(); >> doc.add(new >> Field("TXT_CHARACT_VALUE",word.toLowerCase(),Field.Store.YES,Field.Index= .TOKENIZED)); >> writer.addDocument(doc); >> >> And search >> >> IndexReader reader =3D IndexReader.open(pathOfIndex);=09=09=09 >> Searcher searcher =3D new IndexSearcher(reader); >> Analyzer analyzer =3D new FrenchAnalyzer();=09=09=09=09=09=09 >> QueryParser parser =3D new QueryParser(field, analyzer);=09=09=09=09=09 >> Query query =3D parser.parse(motRecherche); >> Hits hits =3D searcher.search(query); >> >> in my document i have the word "lumiere" and "lumi=C3=A8re" >> >> when i search lumi=C3=A8re only document match lumi=C3=A8re but "lumiere= " is not >> return >> >> and if search "lumiere" the result is lumiere, lumieres ,lumi=C3=A9re,lu= mi=C3=A9res >> but not lumi=C3=A8re >> >> for a total match i must search "lumiere OR limi=C3=A8re" >> but is not the best solution=20 >> =20 >=20 >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org >=20 >=20 >=20 --=20 View this message in context: http://www.nabble.com/search-with-accent-not-= match-tp18848522p18850615.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org