Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 8123 invoked from network); 16 Mar 2007 21:27:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Mar 2007 21:27:14 -0000 Received: (qmail 14491 invoked by uid 500); 16 Mar 2007 21:27:20 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 14452 invoked by uid 500); 16 Mar 2007 21:27:20 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 14441 invoked by uid 99); 16 Mar 2007 21:27:20 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2007 14:27:20 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of mike.klaas@gmail.com designates 66.249.92.169 as permitted sender) Received: from [66.249.92.169] (HELO ug-out-1314.google.com) (66.249.92.169) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2007 14:27:09 -0700 Received: by ug-out-1314.google.com with SMTP id k40so857009ugc for ; Fri, 16 Mar 2007 14:26:48 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=p/YMtfzeI0qm1gaWxwBeBxF/SWXUWocP+w7o24lh3tv1P7ImKW1w+TAcv7NPEBQHFM9N3Q1nsrehPSpi7XHI9jl3HXMDI86rziIvdsARptDv3rEhc+PLba4NI3hg+A67U8qW12hs+0/W9VPzfv0kzcDNm+QLHsm0jn/mq5jMmmc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=e3BbgASSV3BJGz8S4o9Ug7COXWV+xHNozFy8LWolVUbLJtmY/8Op7HZks1eX/+MZWm2uq8N7fkzjc++N1kK0jxWqIay3aZfNV4jusaeaM6ko5FZO4drsv9zEXCtDgv3HVGuXmVb63P8lxu6n0JKCw+7AKIQTC/HtrytMGZNTZnw= Received: by 10.114.26.1 with SMTP id 1mr887227waz.1174080406891; Fri, 16 Mar 2007 14:26:46 -0700 (PDT) Received: by 10.114.196.6 with HTTP; Fri, 16 Mar 2007 14:26:46 -0700 (PDT) Message-ID: <3d2ce8cb0703161426g30875940m391d4f298e401560@mail.gmail.com> Date: Fri, 16 Mar 2007 14:26:46 -0700 From: "Mike Klaas" To: java-dev@lucene.apache.org Subject: Re: Resolving term vector even when not stored? In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: X-Virus-Checked: Checked by ClamAV on apache.org On 3/15/07, karl wettin wrote: > I propose a change of the current IndexReader.getTermFreqVector/s- > code so that it /always/ return the vector space model of a document, > even when set fields are set as Field.TermVector.NO. > > Is that crazy? Could be really slow, but except for that.. And if it > is cached then that information is known by inspecting the fields. > People don't go fetching term vectors without knowing what thay are > doing, are they? The highlighting contrib code does this: attempt to retrieve the termvector, catch InvalidArgumentException, fall back to re-analysis of the data. I'm not sure if that is crazy, but that is what is currently implemented. -Mike --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org