Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 65642 invoked from network); 17 Mar 2009 11:10:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Mar 2009 11:10:38 -0000 Received: (qmail 12719 invoked by uid 500); 17 Mar 2009 11:10:31 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 12690 invoked by uid 500); 17 Mar 2009 11:10:31 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 12679 invoked by uid 99); 17 Mar 2009 11:10:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Mar 2009 04:10:31 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ian.lea@gmail.com designates 209.85.218.174 as permitted sender) Received: from [209.85.218.174] (HELO mail-bw0-f174.google.com) (209.85.218.174) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Mar 2009 11:10:24 +0000 Received: by bwz22 with SMTP id 22so6962bwz.5 for ; Tue, 17 Mar 2009 04:10:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=5zUJ+81uVkUFFteAJCSMtyxW1E9xaKi/VbYtr5wNSNw=; b=vqAKZU1tNXYkViHmN3bzPIEB9OeF2WCobV06tf0T+tNBMRDgAzhCOPdI356orbXT2m MXuO5Bi3aeUaftj1mZLtGdLxWesQ7i0GFKXurUGx6z+EOz3PchpKr5FeJjFwiquc6X3P 5J69oVdu+OAhYD/KWMeahiyrZaE4Eq8RjcLzg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=qHbrkiK5hbkMfugZ8ojwE9VfIWBJNem6pAzyG2Trl7Ij0698k7uXelFKDKDgB7+cbr 8YG8/NoUuOHPDxOr9RwTfyx4Vwgga48sAcnd+1w+82HKdGfnVy5NlgK67hGRUfStzo+k jjAryJhJKLKOKVA4O9kGUHyHMHV3nabrcZ3VM= MIME-Version: 1.0 Received: by 10.204.31.75 with SMTP id x11mr1972931bkc.1.1237288203117; Tue, 17 Mar 2009 04:10:03 -0700 (PDT) In-Reply-To: <66B3083E-06E6-4CA0-B491-3198BB315425@mikemccandless.com> References: <1237198328.57959.ezmlm@lucene.apache.org> <49BE292D.6050809@gmail.com> <4e7841490903170242h1cb23b1bta01ae4c662e99a4e@mail.gmail.com> <66B3083E-06E6-4CA0-B491-3198BB315425@mikemccandless.com> Date: Tue, 17 Mar 2009 11:10:03 +0000 Message-ID: <8c4e68610903170410t6157cd12u15c5bfb98cdea33@mail.gmail.com> Subject: Re: number of hits of pages containing two terms From: Ian Lea To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org This is all getting very complicated! Adrian - have you looked any further into why your original two term query was too slow? My experience is that simple queries are usually extremely fast. Standard questions: have you warmed up the searcher? How large is the index? How many occurrences of your first or second terms? Anything odd about them? See also http://wiki.apache.org/lucene-java/ImproveSearchingSpeed -- Ian. On Tue, Mar 17, 2009 at 11:00 AM, Michael McCandless wrote: > > Adrian Dimulescu wrote: > >> Thank you. >> >> I suppose the solution for this is to not create an index but to store >> co-occurence frequencies at Analyzer level. > > I don't understand how this would address the "docFreq does > not reflect deletions". > > You can use the shingles analyzer (under contrib/analyzers) > to create and index bigrams. =A0(But the docFreq would still not > reflect deletions). > >> Adrian. >> >> On Mon, Mar 16, 2009 at 11:37 AM, Michael McCandless < >> lucene@mikemccandless.com> wrote: >> >>> >>> Be careful: docFreq does not take deletions into account. >>> > > Mike > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org