Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 52668 invoked from network); 23 Nov 2008 22:26:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 23 Nov 2008 22:26:53 -0000 Received: (qmail 15551 invoked by uid 500); 23 Nov 2008 22:26:56 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 15508 invoked by uid 500); 23 Nov 2008 22:26:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 15497 invoked by uid 99); 23 Nov 2008 22:26:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 23 Nov 2008 14:26:56 -0800 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [217.146.183.157] (HELO n9a.bullet.ukl.yahoo.com) (217.146.183.157) by apache.org (qpsmtpd/0.29) with SMTP; Sun, 23 Nov 2008 22:25:28 +0000 Received: from [217.12.4.214] by n9.bullet.ukl.yahoo.com with NNFMP; 23 Nov 2008 22:26:11 -0000 Received: from [87.248.110.115] by t1.bullet.ukl.yahoo.com with NNFMP; 23 Nov 2008 22:26:11 -0000 Received: from [127.0.0.1] by omp220.mail.ukl.yahoo.com with NNFMP; 23 Nov 2008 22:26:13 -0000 X-Yahoo-Newman-Id: 738751.95201.bm@omp220.mail.ukl.yahoo.com Received: (qmail 38067 invoked from network); 23 Nov 2008 22:26:11 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.co.uk; h=Received:X-YMail-OSG:X-Yahoo-Newman-Property:Message-ID:Date:From:User-Agent:MIME-Version:To:Subject:References:In-Reply-To:Content-Type:Content-Transfer-Encoding; b=QbiadRD1LDfV0duTlLVXxh9KwnEhAr/B0LUlCiuyzVAiJaRMrqqPgJcR6Caqp7pCFo1bOc9JVtBYwePSRkAor8JCCFhfDvHv7weN0lRtiVlJsMmWP3ULQbE0rlX+PocFhrTIUuAPbuxiwybpbjm78r2vRayyX+UtPDF9SrHRh38= ; Received: from unknown (HELO ?192.168.2.50?) (markharw00d@194.106.34.5 with plain) by smtp132.mail.ukl.yahoo.com with SMTP; 23 Nov 2008 22:26:11 -0000 X-YMail-OSG: 3VfPxDAVM1n76zi.N8Mg39sHDiwjisZfQjmY_bZEmfBaqZUQ3_fLOlw7bWfBeoJIOz2GLh2so9S9xnIfjeUBD.0BOE..yDI7sYtSnU77qJunTdlecT_OCju3fKwELxdH8KROww8XMv4RIq4V.kG.DP_vU3Z4fsUxSk3_91VEUr1hQDlyTdeiD5tlcVlo X-Yahoo-Newman-Property: ymail-3 Message-ID: <4929D882.9070009@yahoo.co.uk> Date: Sun, 23 Nov 2008 22:26:10 +0000 From: markharw00d User-Agent: Thunderbird 2.0.0.17 (Windows/20080914) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: [ot] a reverse lucene References: <4928D4A8.20500@holsman.net> <49299B73.5090309@getopt.org> <4929B752.8090207@holsman.net> In-Reply-To: <4929B752.8090207@holsman.net> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org If you index the queries consider also that they can potentially be indexed in an optimised form. For example, take a phrase query for "Alonso Smith". You need only index one of these terms - an incoming document must contain both terms to be considered a match. If you chose to index this query on the rare term "Alonso" you would get far fewer requests to run this query than if you chose to index the comparitively more common "Smith". Basically any query with mandatory terms can be "index optimised" to record only the rarest mandatory term (rarity typically being measured by using a look-up on some background index). Cheers, Mark Ian Holsman wrote: > Thanks for all the suggestions guys.. > This is great! > > > Andrzej Bialecki wrote: >> Ian Holsman wrote: >>> Hi. apologies for the off-topic question. >>> >>> I was wondering if anyone knew of a open source solution (or a >>> pointer to the algorithms) >>> that do the reverse of lucene. >>> By that I mean store a whole lot of queries, and run them against a >>> document to see which queries match it. (with a score etc) >>> >>> I can see the case for this would be a news-article and several >>> people writing queries to get alerted if it matched a certain >>> condition. >> >> >> http://www.seas.upenn.edu/~svilen/publications/subscribe.pdf >> >> >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > ------------------------------------------------------------------------ > > > No virus found in this incoming message. > Checked by AVG - http://www.avg.com > Version: 8.0.175 / Virus Database: 270.9.9/1806 - Release Date: 11/22/2008 6:59 PM > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org