Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 83139 invoked from network); 17 Apr 2003 18:55:37 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 17 Apr 2003 18:55:37 -0000 Received: (qmail 27892 invoked by uid 97); 17 Apr 2003 18:57:35 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 27885 invoked from network); 17 Apr 2003 18:57:35 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 17 Apr 2003 18:57:35 -0000 Received: (qmail 81636 invoked by uid 500); 17 Apr 2003 18:55:20 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 81587 invoked from network); 17 Apr 2003 18:55:19 -0000 Received: from mailgw1.fraunhofer.de (153.96.1.62) by daedalus.apache.org with SMTP; 17 Apr 2003 18:55:19 -0000 Received: from mailgw1.fraunhofer.de (localhost [127.0.0.1]) by mailgw1.fraunhofer.de (8.12.9/8.12.9) with ESMTP id h3HItLEL002119 for ; Thu, 17 Apr 2003 20:55:21 +0200 (MEST) Received: from heinrichschiff.igd.fhg.de (host6924.igd.fhg.de [146.140.28.12] (may be forged)) by mailgw1.fraunhofer.de (8.12.9/8.12.9) with ESMTP id h3HItKfe002116 for ; Thu, 17 Apr 2003 20:55:21 +0200 (MEST) Received: from pcara (PCARA-ds.igd.fhg.de [146.140.9.73]) by heinrichschiff.igd.fhg.de (8.9.3+Sun/8.9.3) with SMTP id UAA29052 for ; Thu, 17 Apr 2003 20:48:37 +0200 (MEST) From: "Gregor Heinrich" To: "'Lucene Users List'" Subject: Reading out term frequencies document-wise. Date: Thu, 17 Apr 2003 20:55:15 +0200 Message-ID: <004901c30512$e2d5f9d0$49098c92@pcara> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook CWS, Build 9.0.2416 (9.0.2911.0) Importance: Normal In-Reply-To: <20030416164310.38647.qmail@web12707.mail.yahoo.com> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Hello all, after creating a Lucene index, for some applications it's useful to access term frequencies. Is there any solution to read out term frequencies in an efficient way, something like a IndexReader.docTerms(int docId)? This completely subverts the reverse-index paradigm but is useful, for instance to create a term-document matrix in sparse format (Harwell Boeing, say). A more general question. Is there any current activity to create term-vector support for Lucene? Maybe it's possible to do a joint activity. Thanks, gregor --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org