Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 44875 invoked from network); 28 May 2003 14:34:25 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 28 May 2003 14:34:25 -0000 Received: (qmail 13626 invoked by uid 97); 28 May 2003 14:36:38 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 13619 invoked from network); 28 May 2003 14:36:37 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 28 May 2003 14:36:37 -0000 Received: (qmail 41000 invoked by uid 500); 28 May 2003 14:33:21 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 40935 invoked from network); 28 May 2003 14:33:20 -0000 Received: from mailer.syr.edu (128.230.18.29) by daedalus.apache.org with SMTP; 28 May 2003 14:33:20 -0000 Received: from syr.edu (syru38-212.syr.edu) by mailer.syr.edu (LSMTP for Windows NT v1.1b) with SMTP id <0.05384F20@mailer.syr.edu>; Wed, 28 May 2003 10:33:22 -0400 Message-ID: <3ED4C8C4.4030606@syr.edu> Date: Wed, 28 May 2003 10:33:40 -0400 From: Steve Rowe User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.0.2) Gecko/20021120 Netscape/7.01 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Users List Subject: Re: Wildcard workaround References: <336674949053D511BF4D00A0C9B3D3649EAF0D@SONIA> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Andrei, If this sort of thing is important enough, you could implement a customized analyzer which would reverse all terms and store them in a separate field (in addition to the a non-reversed field). This will double your index size, of course. Then, when searching, suffix query terms (those beginning with '*') could be reversed and wildcard-searched on the reverse-term field. Hope it helps, Steve Rowe Andrei Melis wrote: > Hello all, > > As far as I have understood, lucene does not allow search queries starting > with wildcards. > I have a file database indexed by content and also by filename. It would be > nice if the user could perform a usual search like "*.ext". > > Anybody tried a workaround for this issue ? ( this is needed only for the > name of the file, for the rest of the terms the rules are fine with me) > > I assume that this will produce a decrease of search speed... is this > correct ? > > > 10q, > Andrei > -- Steve Rowe Software Engineer Center for Natural Language Processing School of Information Studies Syracuse University www.cnlp.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org