lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From radarghost <radargh...@yahoo.com>
Subject Re: foreign characters equivalent in solr search
Date Thu, 19 Feb 2009 07:36:41 GMT

thanks

we will try that and post the results here but it seems we may get problem
with highlight function.



Ahmet Arslan wrote:
> 
> I think best way to do this is to modify
> org.apache.lucene.index.memory.SynonymTokenFilter and employ this filter
> index time.
> 
> if token.termBuffer() has one those (á, à, â, ä, ã, å) characters you will
> replace it with its equvalent ascii character (a). Then you will inject
> this new Token as a Synonym.
> 
> I don't know is it the best way but it will give you what you want.
> 
> --- On Wed, 2/18/09, radarghost <radarghost@yahoo.com> wrote:
> 
>> From: radarghost <radarghost@yahoo.com>
>> Subject: foreign characters equivalent in solr search
>> To: solr-user@lucene.apache.org
>> Date: Wednesday, February 18, 2009, 4:28 PM
>> we are using solr 1.2 and dont want to upgrade to 1.3 till
>> official release
>> for Debian.
>> i want solr to search for equivalent of a foreign chracter
>> for getting
>> better results
>> 
>> in example:
>> 
>> if a user searches for Tiesto which is indexed in this
>> format Tiësto in our
>> solr. we want solr also return result
>> return search result for á, à, â, ä, ã, å where they
>> are in word but that
>> word has been searched with normal a
>> e for ë, i for ï, o for ö, and so on
>> 
>> any solution?
>> 
>> hope i could tell what i need with my poor English
>> 
>> thanks
>> 
>> 
>> -- 
>> View this message in context:
>> http://www.nabble.com/foreign-characters-equivalent-in-solr-search-tp22079912p22079912.html
>> Sent from the Solr - User mailing list archive at
>> Nabble.com.
> 
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/foreign-characters-equivalent-in-solr-search-tp22079912p22095325.html
Sent from the Solr - User mailing list archive at Nabble.com.


Mime
View raw message