lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: vector model usage
Date Tue, 01 Jun 2010 17:35:19 GMT

On May 31, 2010, at 6:25 AM, Dionisis Koumouras wrote:

> Hi all,
> I'm new to lucene but have used it succesfully for a few simple tasks.
> 
> I am experimenting with the vector space representation of documents and
> have managed to store and retrieve TermFreqVector objects.
> 
> The question is whether it is possible to directly add vector space
> representations of documents to an index. I can't find any way to create a
> document field from a TermFreqVector object.

The Field constructor can take in a TokenStream (i.e. a preanalyzed stream) which you could
easily back with a TermFreqVector.

> 
> This is the use case behind the question: retrieve some documents from the
> index, cluster them, and store the vector space representations of the
> clusters back to the index.
> 
> Dionisis

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message