lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Maximilian Hütter ...@blue-elephant-systems.com>
Subject Re: Searching combined English-Japanese index
Date Mon, 01 Oct 2007 15:46:32 GMT
Yonik Seeley schrieb:
> On 10/1/07, Maximilian Hütter <mh@blue-elephant-systems.com> wrote:
>> When I search using an English term, I get results but the Japanese is
>> not encoded correctly in the response. (although it is UTF-8 encoded)
> 
> One quick thing to try is the python writer (wt=python) to see the
> actual unicode values of what you are getting back (since the python
> writer automatically escapes non-ascii).  That can help rule out
> incorrect charset handling by clients.
> 
> -Yonik
> 
Thanks for the tip, it turns out that the unicode values are wrong... I
mean the browser displays correctly what is send. But I don't know how
solr gets these values.

For example python output is:

'key':'honshu_server_ovo:application_List VPO NT Templates_integrated',
	 'backend':'honshu_server',
	 'service':'ovoconfig',
	 'objectclass':'ovo:application',
	 'objecttype':'integrated',
	 'name':'List VPO NT Templates',
	 'label':u'VPO
\u00e3\u0083\u0086\u00e3\u0083\u00b3\u00e3\u0083\u0097\u00e3\u0083\u00ac\u00e3\u0083\u00bc\u00e3\u0083\u0088',
	 'path':'',
	 'context':'',
	 'revision':'',
	 'description':'',
	 'ovo:application_name':'List VPO NT Templates'},

But in Limo the doc looks like this:

key  	 	honshu_server_ovo:application_List VPO NT Templates_integrated
backend 	honshu_server
service 	ovoconfig
objectclass 	ovo:application
objecttype 	integrated
name 	 	List VPO NT Templates
label 	 	VPO テンプレート
path 	 	 	
context 	 	
revision 	 	
description 	 	
ovo:application_name 	List VPO NT Templates

I hope you can view the japanese katakana in the label field.

But somehow this is changed to completely different unicode characters
in the search result.

Max

-- 
Maximilian Hütter
blue elephant systems GmbH
Wollgrasweg 49
D-70599 Stuttgart

Tel            :  (+49) 0711 - 45 10 17 578
Fax            :  (+49) 0711 - 45 10 17 573
e-mail         :  max.huetter@blue-elephant-systems.com
Sitz           :  Stuttgart, Amtsgericht Stuttgart, HRB 24106
Geschäftsführer:  Joachim Hörnle, Thomas Gentsch, Holger Dietrich

Mime
View raw message