lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Geet Gangwar <>
Subject Custom Tokenizer/Analyzer
Date Thu, 20 Feb 2014 09:46:11 GMT

I have a requirement to write a custom tokenizer using Lucene framework.

My requirement is it should have capabilities to match multiple words as
one token. for example. When user passes String as International Business
machine logo or IBM logo it should return International Business Machine as
one token and logo as one token.

Please help me as how can I approach this ...



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message