Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8BBCF9DA3 for ; Tue, 25 Oct 2011 07:48:10 +0000 (UTC) Received: (qmail 98342 invoked by uid 500); 25 Oct 2011 07:48:08 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 98116 invoked by uid 500); 25 Oct 2011 07:48:08 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 98108 invoked by uid 99); 25 Oct 2011 07:48:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Oct 2011 07:48:07 +0000 X-ASF-Spam-Status: No, hits=2.1 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLYTO,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of simon.willnauer@googlemail.com designates 74.125.82.176 as permitted sender) Received: from [74.125.82.176] (HELO mail-wy0-f176.google.com) (74.125.82.176) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Oct 2011 07:48:02 +0000 Received: by wyf19 with SMTP id 19so254685wyf.35 for ; Tue, 25 Oct 2011 00:47:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=mime-version:reply-to:in-reply-to:references:date:message-id :subject:from:to:content-type:content-transfer-encoding; bh=YmaktjZmVZTrmk2eLNzqEGto1vnKqkTOU6bFQT10ZO4=; b=ruJ00vatuLXoD67gQh4LuitdSB2IMMUYfReop/jIipV1exD6pVTdyhyPPKt4ezzIxk 1eoQFCfoU5yOCamTRF6LXgGhTt/r7+f0ysqvm8RJCbxhb/1gKlP3asTKG8dNoemsf8QV bYxBrRqogRrj5rKf5YGDhRd8gnvXIEpDi0bDI= MIME-Version: 1.0 Received: by 10.227.202.143 with SMTP id fe15mr1983284wbb.25.1319528861379; Tue, 25 Oct 2011 00:47:41 -0700 (PDT) Received: by 10.180.88.8 with HTTP; Tue, 25 Oct 2011 00:47:41 -0700 (PDT) Reply-To: simon.willnauer@gmail.com In-Reply-To: References: Date: Tue, 25 Oct 2011 09:47:41 +0200 Message-ID: Subject: Re: reusing the term-frequency count while indexing From: Simon Willnauer To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Tue, Oct 25, 2011 at 5:08 AM, prasenjit mukherjee wrote: > Thats exactly I was trying to avoid :( > > I can afford to do that during indexing time, but it will be > time-consuming to do that at search time. hu? I don't understand, if you provide the terms at indexing time lucene keeps track of the term frequency etc. why would you want to do this at search time? simon > > On Mon, Oct 24, 2011 at 1:05 PM, Simon Willnauer > wrote: >> so you are saying you got (uniqueTerm, freq) tuples and you want to >> make lucene use this directly? I think the easiest way is to write a >> simple tokenFilter that emit the term X times where X is the term >> frequency. There is no easy way to pass these tuples to lucene >> directly. >> >> simon >> >> On Mon, Oct 24, 2011 at 3:28 AM, prasenjit mukherjee >> wrote: >>> Can you tell me how I can feed the lucene index by using the term >>> frequency directly ? >>> >>> Actually I am getting the documents along with their term-frequency >>> and don't want to write any additional code to expand them. >>> >>> >>> On 10/23/11, ppp c wrote: >>>> Of curse, it can be reused. >>>> But from my point of view, it's meaningless, since the analysis proces= s has >>>> to be performed to collect such as prox, offset, or syno, payload and = so on. >>>> >>>> On Sun, Oct 23, 2011 at 11:22 PM, prasenjit mukherjee >>>> wrote: >>>> >>>>> I already have the term-frequency-count for all the terms =C2=A0in a >>>>> document. Is there a way I can re-use that info while indexing. I >>>>> would like to use solr for this. >>>>> >>>>> --------------------------------------------------------------------- >>>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>>>> For additional commands, e-mail: java-user-help@lucene.apache.org >>>>> >>>>> >>>> >>> >>> -- >>> Sent from my mobile device >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>> For additional commands, e-mail: java-user-help@lucene.apache.org >>> >>> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org