lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From SBS <jturn...@uow.edu.au>
Subject Enabling indexing of hyphenated terms sans the hyphen
Date Mon, 19 Sep 2011 20:27:08 GMT
We use StandardTokenizer and this works well but we also need to include
terms in our index which consist of hyphenated terms with the hyphen
removed.  So, for example, if the text being indexed contains "self-induced"
we need the terms "self", "induced" and "selfinduced" to be indexed.

How would I go about implementing this?  We use Lucene Java 3.2.

Thanks,

-sbs

--
View this message in context: http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3350008.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message