lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From suriya prakash <suriy...@gmail.com>
Subject Email id tokenizer (actual email id & multiple terms)
Date Tue, 20 Dec 2016 14:15:06 GMT
Hi,

I am using standard analyzer and want to split token for email_id "
lucene@gmail.com" as "lucene", "gmail","com","lucene@gmail.com" in a single
pass.

I have already changed jflex to split email id as separate words(lucene,
gmail, com). But we need to do phrase search which will not be efficient.
So i want to index actual email id and splitted words.

Can you please help me to achieve this. OR let me know whether phrase
search is efficient for this case?


Regards,
Suriya

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message