Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 33815 invoked from network); 18 Mar 2004 16:20:15 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 18 Mar 2004 16:20:15 -0000 Received: (qmail 38039 invoked by uid 500); 18 Mar 2004 16:20:02 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 38013 invoked by uid 500); 18 Mar 2004 16:20:02 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 37996 invoked from network); 18 Mar 2004 16:20:02 -0000 Received: from unknown (HELO hamhock.hoovers.com) (66.179.38.26) by daedalus.apache.org with SMTP; 18 Mar 2004 16:20:02 -0000 Received: from exchange.hoovers.com (gamma.hoovers.com [66.179.38.8]) by hamhock.hoovers.com (HamHock-OUTBOUND) with ESMTP id 8CA4B8AF7C for ; Thu, 18 Mar 2004 10:20:04 -0600 (CST) Received: by gamma.hoovers.com with Internet Mail Service (5.5.2657.72) id ; Thu, 18 Mar 2004 10:20:04 -0600 Message-ID: <4810C017B52BD4118EEA00B0D020B0ED1408543D@gamma.hoovers.com> From: Tim Jones To: 'Lucene Users List' Subject: Re: Using lucene as search engine to replace sql queries Date: Thu, 18 Mar 2004 10:20:03 -0600 MIME-Version: 1.0 X-Mailer: Internet Mail Service (5.5.2657.72) Content-Type: text/plain; charset="iso-8859-1" X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N It seems like you could do it in Lucene: - the first table (Human/Man/Dutch) would be considered Terms in Lucene (each record would correspond to a Term) - the second table would be turned into a Document for each unique id in the table (for example, one document would be the relationship id 1 example you give). > -----Original Message----- > From: Martin van den Bemt [mailto:mllist@mvdb.net] > Sent: Thursday, March 18, 2004 10:04 AM > To: lucene-user@jakarta.apache.org > Subject: Using lucene as search engine to replace sql queries > > Hi everyone, > > We have a database where in one table it contains an id and a name. > (like 1,Human 2,Man 3,Dutch) > The second table contains an id, a relationId and a reference to the > previous table. > So if a relation with id 1 is human is a Man and is Dutch the > table will > contain > 1, 1, 1 > 2, 1, 2 > 3, 1, 3 > You probably get the picture. > > We want to be able to eg query all humans that are Dutch. (we > don't care > if it is a man or woman). Using sql can be quite horrific for > these kind > of queries. > > I thought that maybe Lucene could do the indexing for us > instead of the > sql database. > > It is not uncommon that the table with names contain about 3 million > entries and the relationship table can be a multiple of 3 million. > > The question is : > > - Is Lucene capable of handling huge amounts of data ? > - The result always must be EXACT. So no fuzzy stuff. If it has the > keywords in the index, show it, else never show it (so a query for > Human, Man, Dutch, should not return any people from > Belgium). (I assume > this should be possible) > - Besides the index information we want to be able to store some extra > data (like a description), that can be used to create an object we can > use in our system. > - Is Lucene the way to go for this use scenario ? > > Thanx for your help. > > -- > Mvgr, > Martin --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org