Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 91308 invoked from network); 28 May 2003 18:00:50 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 28 May 2003 18:00:50 -0000 Received: (qmail 20117 invoked by uid 97); 28 May 2003 18:03:04 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 20110 invoked from network); 28 May 2003 18:03:03 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 28 May 2003 18:03:03 -0000 Received: (qmail 90775 invoked by uid 500); 28 May 2003 18:00:43 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 90692 invoked from network); 28 May 2003 18:00:42 -0000 Received: from mail12.atl.registeredsite.com (64.224.219.86) by daedalus.apache.org with SMTP; 28 May 2003 18:00:42 -0000 Received: from netmail.mail.registeredsite.com ([216.122.69.17]) by mail12.atl.registeredsite.com (8.12.8/8.12.6) with ESMTP id h4SH0oqe014936 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=NOT) for ; Wed, 28 May 2003 13:00:56 -0400 Received: (qmail 85521 invoked by uid 89); 28 May 2003 18:01:37 -0000 Received: from unknown (HELO ehatchersolutions.com) (128.143.103.27) by netmail.registeredsite.com with SMTP; 28 May 2003 18:01:37 -0000 Date: Wed, 28 May 2003 14:00:35 -0400 Subject: Re: Wildcard workaround Content-Type: text/plain; charset=US-ASCII; format=flowed Mime-Version: 1.0 (Apple Message framework v552) From: Erik Hatcher To: "Lucene Users List" Content-Transfer-Encoding: 7bit In-Reply-To: <3ED4DCBA.60104@sundayta.com> Message-Id: <48445F0C-9136-11D7-B239-000393A564E6@ehatchersolutions.com> X-Mailer: Apple Mail (2.552) X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N The problem with that solution is the same as what the other thread about OutOfMemory is discussing with wildcard queries. Just prefixing something with a fixed query to 'hack' a wildcard query could lead to performance/memory issues. I recommend indexing the file extension (or mime type) as a separate field. Erik On Wednesday, May 28, 2003, at 11:58 AM, David Warnock wrote: > Aviran, > >> You can also index the file names with a leading character. For >> instance >> index "file1.exe" will be indexed as "_file1.exe" and always add the >> leading character to the search term. >> So if the user input is "*.exe" your query should be "_*.exe" and if >> the >> user input "fi*" you'll change it to "_fi*" > > Now that is a much neater solution than mine. Does this cause problems > with unbalanced indexes or anything as all terms begin with the same > character? > > Dave > -- > David Warnock, Sundayta Ltd. http://www.sundayta.com > iDocSys for Document Management. VisibleResults for Fundraising. > Development and Hosting of Web Applications and Sites. > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org