lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: Retrieving the term vectors of a document in Nutch
Date Mon, 08 Jun 2009 12:59:30 GMT
I'd ask on the mailing list.  While  
Lucene can do all of these things, it is not clear how Nutch exposes,  
if at all, any of this information.  You should be able to get results  

Note, however, that Term Vecs must be created during indexing by  
creating the Field properly.  You could likely modify the Nutch code  
where it creates the Lucene Document and Fields to add in Term Vector  


On Jun 7, 2009, at 8:58 PM, House Less wrote:

> In retrospect, pardon my stupidity: surely it cannot be right that  
> the term frequency vector for a page is not present within Nutch,  
> for it needs this to compute the score for a page given a query. I  
> would appreciate it if you would tell me where I may find it given a  
> document number. Thank you.
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

Grant Ingersoll

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message