Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 280 invoked from network); 17 Sep 2008 14:42:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Sep 2008 14:42:15 -0000 Received: (qmail 60218 invoked by uid 500); 17 Sep 2008 14:42:03 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 60191 invoked by uid 500); 17 Sep 2008 14:42:03 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 60170 invoked by uid 99); 17 Sep 2008 14:42:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2008 07:42:03 -0700 X-ASF-Spam-Status: No, hits=4.1 required=10.0 tests=DNS_FROM_OPENWHOIS,SPF_HELO_PASS,SPF_PASS,WEIRD_PORT,WHOIS_MYPRIVREG X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2008 14:41:02 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1KfyDm-0007WB-CC for java-user@lucene.apache.org; Wed, 17 Sep 2008 07:41:34 -0700 Message-ID: <19533647.post@talk.nabble.com> Date: Wed, 17 Sep 2008 07:41:34 -0700 (PDT) From: anandsarwade To: java-user@lucene.apache.org Subject: Lucene search fails for japanese characters in URL MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Nabble-From: anand.sarwade@corp.aol.com X-Virus-Checked: Checked by ClamAV on apache.org Hi , I am facing below problem. Please help me in this. I have integrated CJK Analyzer for Japanese characters. I am able to save= =20 japanese double byte characters in mysql database in UTF-8 format without issues. I could that data is getted indexed. Now when i search the Japanese characters which were indexed using the URL below , returns empty results. =20 http://xml.demo.myaol.jp:8082/portal/gallery-search?first=3D1&max=3D100&cap= =3D=E8=A8=80=E8=AA=9E=20 =20 Noticed that the above url gets converted to the following URL having some HTML encoded strings in search. =20 http://xml.demo.myaol.jp:8082/portal/gallery-search?first=3D1&max=3D100&cap= =3D%E8%A8%80%E8%AA%9E =20 This does not match with the existing lucene indexes henceforth returns empty results. How do i solve this lucene search issue having japanese words in URLs.? Is there any way to convert such characters back to Japanes= e words???=20 Any help/suggestions in this regards is highly appreciated. Thanks in Advance. Regards, Anand=20 --=20 View this message in context: http://www.nabble.com/Lucene-search-fails-for= -japanese-characters-in-URL-tp19533647p19533647.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org