lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sharon Tam <>
Subject Indexing Term Frequency Vectors
Date Thu, 28 Mar 2013 19:25:28 GMT
I believe that when Lucene indexes documents, it generates counts for a
term by counting how many times the term appears in a particular document.
Instead of having Lucene do the counting, I want to do my own counting and
feed a term-frequency vector representation of a document directly into the
indexer which will take my counts and proceed to do the other processing
such as generating inverse document frequency.  My term-frequencies may not
all be integers.  Is there a way to do this?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message