lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From maxSchlein <>
Subject Controlling what is indexed / normalizing our index
Date Mon, 15 Feb 2010 21:28:04 GMT

We have a list of keywords with aliases (Example:  keyword = "ms access"
aliases = "microsoft access", "msaccess", "m.s. access"  )

We would like to intercept the aliases prior to them being indexed, and have
the keyword indexed instead.  We can do this with a CustomFilter for single
word aliases.  (Example: in filter token = "access", we change value to
"msaccess").  Our problem is when the token equals microsoft, we need to
find out if the next token is access or not, that is, does it match one of
our aliases.

Has anyone had an issue like?  Any and all help is appreciated.  Thanx.
View this message in context:
Sent from the Lucene - Java Users mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message