lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: Arabic analyser
Date Mon, 09 Nov 2015 15:47:45 GMT
Use an index-time (but not query time) synonym filter with a rule like:

Abd Allah,Abdallah

This will index the combined word in addition to the separate words.

-- Jack Krupansky

On Mon, Nov 9, 2015 at 4:48 AM, Mahmoud Almokadem <prog.mahmoud@gmail.com>
wrote:

> Hello,
>
> We are indexing Arabic content and facing a problem for tokenizing multi
> terms phrases like 'عبد الله' 'Abd Allah', so users will search for
> 'عبدالله' 'Abdallah' without space and need to get the results of 'عبد
> الله' with space. We are using StandardTokenizer.
>
>
> Is there any configurations to handle this case?
>
> Thank you,
> Mahmoud
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message