Hi guys!
does there exist a way to define some threshold on the terms I wanna store
in the index(before they are indexed). I need to store the terms with
higheest frequencies. I done it with term vectors and some cutoff ratio that
cuts off the least occuring terms, but all this is, ofcourse works during
retrieval time, reading from index.
I know it make no sense to be able to calculate frequencies of the terms
before they are stored, but i guess there could be some way to work it
around???
All hellp appreciated!
Thank you!
--
View this message in context: http://lucene.472066.n3.nabble.com/Applying-term-frequency-thresholds-on-indexing-time-tp839449p839449.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
|