Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 49168 invoked from network); 8 Jul 2008 01:01:35 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 8 Jul 2008 01:01:35 -0000 Received: (qmail 63436 invoked by uid 500); 8 Jul 2008 01:01:35 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 63413 invoked by uid 500); 8 Jul 2008 01:01:35 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 63402 invoked by uid 99); 8 Jul 2008 01:01:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Jul 2008 18:01:35 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of blazingwolf7@gmail.com designates 209.85.146.178 as permitted sender) Received: from [209.85.146.178] (HELO wa-out-1112.google.com) (209.85.146.178) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Jul 2008 01:00:43 +0000 Received: by wa-out-1112.google.com with SMTP id l24so1544595waf.11 for ; Mon, 07 Jul 2008 18:01:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:reply-to :to:subject:in-reply-to:mime-version:content-type:references; bh=3g8T+9uTHCKRhOUlQTWvfIYsT43/OjcuTR8vRZkKt98=; b=W5Ztx/PkT0DMZeX/fnTJTgZvnTvytk7C0Sf0D7hyhpStfFVbq31+kGhE3cEbURDPw9 +nmR918opvI4nzrRzWD/q+esIPHqi7rOUGXK4+bXVAf9pXJ5f2AfXjlncK1rESlbh31N 2PD1LSN1qcDgt14DLUFUNxmLlJcfYgPnSshFU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:reply-to:to:subject:in-reply-to:mime-version :content-type:references; b=EaO2tf4hrbbjCQBZ8m3ZpXeBFUn5wiv6gmJi7Muu5gv6aiPqscTooV5ImsoTGSa+y8 NDiRN3PWKUtieXlUlysAbp+alnCXZktcHoAvAbhUUw4SsqOuQdps/WxWC5B5NSg1LTkJ O9MisVc1Iw7Egs774/KuFIwu5RRPN06BZUdV8= Received: by 10.114.159.17 with SMTP id h17mr7134859wae.227.1215478865196; Mon, 07 Jul 2008 18:01:05 -0700 (PDT) Received: by 10.114.77.7 with HTTP; Mon, 7 Jul 2008 18:01:05 -0700 (PDT) Message-ID: Date: Tue, 8 Jul 2008 09:01:05 +0800 From: "lee gary" Reply-To: blazingwolf7@yahoo.com To: general@lucene.apache.org Subject: Alternatives for opening Index In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_22491_28530203.1215478865161" References: X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_22491_28530203.1215478865161 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi, I am currently working on retrieving url and contentLength of each document found during the search. I want to retrieve it during the calculation of score so that I can influence the score in some other way. I am trying to retrieve the url and use it as filter. The main problem is I don't want to use a reader to continuously retrieve the url for each document located. TermDocs termDocs = reader.termDocs(); TermEnum termEnum = reader.terms (new Term (field, "")); do{ Term term = termEnum.term(); }while(termEnum.next()); I am using this code to retrieve the field containing the url but it is tokenized. Is there anyway to untokenized it or is there a better way to do this? Also, I try to retrieve the contentLength, but the results return are null. Why is that? I opened the index using Luke and the contentLength is there but when I try to get it using this way, the results is null. Can anyone help me with both of these problems? Any help will be appreciated. Thanks ------=_Part_22491_28530203.1215478865161--