lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jimi Hullegård <jimi.hulleg...@mogul.com>
Subject RE: Lucene search fails for japanese characters in URL
Date Wed, 17 Sep 2008 14:49:36 GMT
What webserver are you using? For example, with Tomcat, it could be because of the setting
URIEncoding in server.xml.

http://tomcat.apache.org/tomcat-5.5-doc/config/http.html

/Jimi

mogul | jimi hullegård | system developer | hudiksvallsgatan 4, 113 30 stockholm sweden |
+46 8 506 66 172 | +46 765 27 19 55 | jimi.hullegard@mogul.com | www.mogul.com


> -----Original Message-----
> From: anandsarwade [mailto:anand.sarwade@corp.aol.com]
> Sent: den 17 september 2008 16:42
> To: java-user@lucene.apache.org
> Subject: Lucene search fails for japanese characters in URL
>
>
> Hi ,
>
> I am facing below problem. Please help me in this.
>
> I have integrated CJK Analyzer for Japanese characters. I am
> able to save
> japanese double byte characters in mysql database in UTF-8
> format without
> issues. I could that data is getted indexed. Now when i
> search the Japanese
> characters which were indexed using the URL below , returns
> empty results.
>
> http://xml.demo.myaol.jp:8082/portal/gallery-search?first=1&ma
> x=100&cap=言語
>
> Noticed that the above url gets converted to the following
> URL having some
> HTML encoded strings in search.
>
> http://xml.demo.myaol.jp:8082/portal/gallery-search?first=1&ma
> x=100&cap=%E8%A8%80%E8%AA%9E
>
> This does not match with the existing lucene indexes
> henceforth returns
> empty results.  How do i solve this lucene search issue
> having japanese
> words in URLs.? Is there any way to convert such characters
> back to Japanese
> words???
>
> Any help/suggestions in this regards is highly appreciated.
>
> Thanks in Advance.
>
> Regards,
> Anand
>
> --
> View this message in context:
> http://www.nabble.com/Lucene-search-fails-for-japanese-charact
> ers-in-URL-tp19533647p19533647.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
Mime
View raw message