Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 52907 invoked from network); 26 Dec 2006 04:08:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Dec 2006 04:08:17 -0000 Received: (qmail 67795 invoked by uid 500); 26 Dec 2006 04:08:18 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 67764 invoked by uid 500); 26 Dec 2006 04:08:18 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 67753 invoked by uid 99); 26 Dec 2006 04:08:18 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Dec 2006 20:08:18 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of lists@nabble.com designates 72.21.53.35 as permitted sender) Received: from [72.21.53.35] (HELO talk.nabble.com) (72.21.53.35) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Dec 2006 20:08:07 -0800 Received: from [72.21.53.38] (helo=jubjub.nabble.com) by talk.nabble.com with esmtp (Exim 4.50) id 1Gz3bO-0007Lv-Vv for java-user@lucene.apache.org; Mon, 25 Dec 2006 20:07:46 -0800 Message-ID: <8050926.post@talk.nabble.com> Date: Mon, 25 Dec 2006 20:07:46 -0800 (PST) From: Venkateshprasanna To: java-user@lucene.apache.org Subject: Re: Extracting data from Lucene index files In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: prasannahmv@yahoo.co.in References: <7850919.post@talk.nabble.com> <9CB86CF1-2489-46C8-B948-F6039783EA82@apache.org> <7984092.post@talk.nabble.com> X-Virus-Checked: Checked by ClamAV on apache.org Thanks a lot Doron, it worked fine and thanks for your tip as well! Prasanna Using term vectors means passing on the terms too many times - i.e - loop on terms - - loop on docs of a term - - - loop on terms of a doc Would something like this be better: do { System.out.println(tenum.term()+" appears in "+tenum.docFreq()+" docs!"); TermDocs td = reader.termDocs(tenum.term()); do { System.out.println(" In doc id: "+td.doc() + " it appears: " + td.freq()+ " times"); } while (td.next()); } while (tenum.next()); Also, you can skip faster to a certain doc (id) or certain term using the skipTo() methods. Doron -- View this message in context: http://www.nabble.com/Extracting-data-from-Lucene-index-files-tf2813318.html#a8050926 Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org