Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 80858 invoked from network); 14 Apr 2006 15:20:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 14 Apr 2006 15:20:31 -0000 Received: (qmail 52460 invoked by uid 500); 14 Apr 2006 15:20:21 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 52409 invoked by uid 500); 14 Apr 2006 15:20:21 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 52379 invoked by uid 99); 14 Apr 2006 15:20:21 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Apr 2006 08:20:21 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of findmath@gmail.com designates 64.233.184.238 as permitted sender) Received: from [64.233.184.238] (HELO wproxy.gmail.com) (64.233.184.238) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Apr 2006 08:20:20 -0700 Received: by wproxy.gmail.com with SMTP id 55so567391wri for ; Fri, 14 Apr 2006 08:20:00 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=K073BZoT8BbjVNLKX2xLZOZ0JIjBB3XTdpkQcXd66Feg8aEDHS9q0Fi9GeOicrY3nfu2NfDVKWSSfZwLDkaNEgOY3vkxyUGi0/FjOHptTZ8fpFW9iHZVLqoRX2vjLAIWP0Y8Fq5lnHavkVV/mgwJGRgJ5WhFwoU769+OP1N/4b4= Received: by 10.65.126.6 with SMTP id d6mr984328qbn; Fri, 14 Apr 2006 08:19:59 -0700 (PDT) Received: by 10.65.248.6 with HTTP; Fri, 14 Apr 2006 08:19:59 -0700 (PDT) Message-ID: Date: Fri, 14 Apr 2006 10:19:59 -0500 From: "Rajesh Munavalli" To: java-user@lucene.apache.org Subject: Re: Syntax help In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_28266_7717975.1145027999643" References: X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_28266_7717975.1145027999643 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline On 4/14/06, Bill Snyder wrote: > > AHA! I am using the Search tab and have enteres the query : > > location:Z:\install\logs\archive.log.D20060406.T141958 > > the query details says the query was parsed to > > location:z > > so if I escape the colon I see the new parsed query as > > location:"z installlogsarchive.log.d20060406.t141958" > > So, lucence does not store the file path exactly?! It converts it all > lower > case! Is there some property I should turn on? In the StandardAnalyzer, the LowerCaseFilter converts everything into lower case. You can skip that step. Plus, it is not storing the backslash. Should I be escaping these in the > index before storing them? It seems so. Yes -Bill On 4/14/06, Bill Snyder wrote: > > Oh, cool. Look at that. A neat tool made with thinlets. I had not heard o= f > this...I'll see if it helps me figure out whats going on. > > --Bill > > > On 4/14/06, Rajesh Munavalli wrote: > > > > It would be helpful to download Luke (http://www.getopt.org/luke/) and > > analyze whats getting indexed. Have you tried that? > > > > On 4/14/06, Bill Snyder < wsnyder6@gmail.com> wrote: > > > > > > Hello, > > > > > > We am using Lucene to facilitate searching of our applications log > > files. > > > I > > > am noticing some inconsistencies in result sets when searching on > > certain > > > fields. > > > > > > One field we index is the file path. I am using a simple query like > > > "location:Z:\logs\someLogFile.log". However, I can never get path > > searches > > > like this to come back with any results. Tried escaping the > > backslashes > > > and > > > colon. Nothing seems to work. I missing something here in my syntax? > > > > > > We also index the file name. However, on file names that have mixed > > case > > > or > > > multiple extensions (logfile.D20060303.T234234 ) I cannot get results > > > either. > > > Weird. > > > > > > I haven't worked with Lucene very long, so I expect I am missing > > something > > > simple here. > > > > > > If you need more info, let me know! > > > Many Thanks! > > > > > > --Bill > > > > > > > > > > > ------=_Part_28266_7717975.1145027999643--