lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: vector model usage
Date Tue, 01 Jun 2010 17:35:19 GMT

On May 31, 2010, at 6:25 AM, Dionisis Koumouras wrote:

> Hi all,
> I'm new to lucene but have used it succesfully for a few simple tasks.
> I am experimenting with the vector space representation of documents and
> have managed to store and retrieve TermFreqVector objects.
> The question is whether it is possible to directly add vector space
> representations of documents to an index. I can't find any way to create a
> document field from a TermFreqVector object.

The Field constructor can take in a TokenStream (i.e. a preanalyzed stream) which you could
easily back with a TermFreqVector.

> This is the use case behind the question: retrieve some documents from the
> index, cluster them, and store the vector space representations of the
> clusters back to the index.
> Dionisis

Grant Ingersoll

Search the Lucene ecosystem using Solr/Lucene:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message