lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From anandsarwade <>
Subject Lucene search fails for japanese characters in URL
Date Wed, 17 Sep 2008 14:41:34 GMT

Hi ,

I am facing below problem. Please help me in this.

I have integrated CJK Analyzer for Japanese characters. I am able to save 
japanese double byte characters in mysql database in UTF-8 format without
issues. I could that data is getted indexed. Now when i search the Japanese
characters which were indexed using the URL below , returns empty results.言語 
Noticed that the above url gets converted to the following URL having some
HTML encoded strings in search.
This does not match with the existing lucene indexes henceforth returns
empty results.  How do i solve this lucene search issue having japanese
words in URLs.? Is there any way to convert such characters back to Japanese

Any help/suggestions in this regards is highly appreciated.

Thanks in Advance.


View this message in context:
Sent from the Lucene - Java Users mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message