lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Junte Zhang <>
Subject RE: multi language search engine in solr
Date Mon, 11 Sep 2017 16:32:48 GMT
Having the language already separated makes it a lot easier. 

You could add the language suffix (e.g. 3 letter with ISO 639-2B
per field where you have the different languages. Or else you could have copied an entire
field to their language-analyzed fields, and hope that would be good enough for matching.

I think Malay should be very similar to Indonesian (
However, you could extend this by adding your own dictionary (keywords) and stopwords (if
that is desirable).


-----Original Message-----
From: Mugeesh Husain [] 
Sent: Monday, September 11, 2017 3:46 AM
Subject: Re: multi language search engine in solr

Thank you rick for your response.

The document document have sepearte of the lanaguage instead of mix of Arabic, English, Bengali,
Hindi, Malay.

I coul not find any tokenizer for Malay, can you suggest me if you know please.

Sent from:

View raw message