Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 77464 invoked from network); 8 Jun 2009 12:59:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Jun 2009 12:59:55 -0000 Received: (qmail 55740 invoked by uid 500); 8 Jun 2009 13:00:05 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 55662 invoked by uid 500); 8 Jun 2009 13:00:04 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 55652 invoked by uid 99); 8 Jun 2009 13:00:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Jun 2009 13:00:04 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [208.97.132.202] (HELO spunkymail-a4.g.dreamhost.com) (208.97.132.202) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Jun 2009 12:59:54 +0000 Received: from [192.168.0.105] (adsl-074-229-189-244.sip.rmo.bellsouth.net [74.229.189.244]) by spunkymail-a4.g.dreamhost.com (Postfix) with ESMTP id 294543BA05 for ; Mon, 8 Jun 2009 05:59:32 -0700 (PDT) Message-Id: <6228ED4B-A9EE-4D61-BF0A-7C186471BE1D@apache.org> From: Grant Ingersoll To: java-user@lucene.apache.org In-Reply-To: <101538.27115.qm@web45308.mail.sp1.yahoo.com> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v935.3) Subject: Re: Retrieving the term vectors of a document in Nutch Date: Mon, 8 Jun 2009 08:59:30 -0400 References: <780692.37408.qm@web45302.mail.sp1.yahoo.com> <101538.27115.qm@web45308.mail.sp1.yahoo.com> X-Mailer: Apple Mail (2.935.3) X-Virus-Checked: Checked by ClamAV on apache.org I'd ask on the nutch-user@lucene.apache.org mailing list. While Lucene can do all of these things, it is not clear how Nutch exposes, if at all, any of this information. You should be able to get results there. Note, however, that Term Vecs must be created during indexing by creating the Field properly. You could likely modify the Nutch code where it creates the Lucene Document and Fields to add in Term Vector capabilities. -Grant On Jun 7, 2009, at 8:58 PM, House Less wrote: > > In retrospect, pardon my stupidity: surely it cannot be right that > the term frequency vector for a page is not present within Nutch, > for it needs this to compute the score for a page given a query. I > would appreciate it if you would tell me where I may find it given a > document number. Thank you. > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > -------------------------- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene: http://www.lucidimagination.com/search --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org