Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 84735 invoked from network); 31 May 2010 19:32:09 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 31 May 2010 19:32:09 -0000 Received: (qmail 97065 invoked by uid 500); 31 May 2010 19:32:07 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 97013 invoked by uid 500); 31 May 2010 19:32:07 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 97005 invoked by uid 99); 31 May 2010 19:32:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 May 2010 19:32:07 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.160.176] (HELO mail-gy0-f176.google.com) (209.85.160.176) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 May 2010 19:32:00 +0000 Received: by gyf1 with SMTP id 1so3725347gyf.35 for ; Mon, 31 May 2010 12:31:39 -0700 (PDT) MIME-Version: 1.0 Received: by 10.151.4.1 with SMTP id g1mr5309777ybi.175.1275334299714; Mon, 31 May 2010 12:31:39 -0700 (PDT) Received: by 10.151.11.20 with HTTP; Mon, 31 May 2010 12:31:39 -0700 (PDT) In-Reply-To: <4C038409.6070907@getopt.org> References: <002701cb009e$f0a0e4a0$d1e2ade0$@thetaphi.de> <4C038409.6070907@getopt.org> Date: Mon, 31 May 2010 15:31:39 -0400 Message-ID: Subject: Re: Question about Field.setOmitTermFreqAndPositions(true) From: Michael McCandless To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org TermVectors are not used for searching; they just store each doc, inverted. They allow you to retrieve all terms (and optionally their positions/offsets) for a given document. But this entails a seek, per-document, so it's fairly costly. Highlighters use term vectors because they are a good way to map a given term back to the start/end offset in the original text; without them you usually have to re-analyze the text (though, you could also do highlighting client-side, eg use JS to locate all surface forms for a given term, and highlight them, for HTML; or ask Acrobat Reader to similarly highlight terms). Mike On Mon, May 31, 2010 at 5:40 AM, Andrzej Bialecki wrote: > On 2010-05-31 10:54, Uwe Schindler wrote: >> No. > > See also LUCENE-2048 (nice round number ;) ). > > > -- > Best regards, > Andrzej Bialecki =A0 =A0 <>< > =A0___. ___ ___ ___ _ _ =A0 __________________________________ > [__ || __|__/|__||\/| =A0Information Retrieval, Semantic Web > ___|||__|| =A0\| =A0|| =A0| =A0Embedded Unix, System Integration > http://www.sigram.com =A0Contact: info at sigram dot com > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org