Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 12675 invoked from network); 20 Jun 2007 11:53:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 20 Jun 2007 11:53:55 -0000 Received: (qmail 14157 invoked by uid 500); 20 Jun 2007 11:53:49 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 14109 invoked by uid 500); 20 Jun 2007 11:53:48 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 14098 invoked by uid 99); 20 Jun 2007 11:53:48 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Jun 2007 04:53:48 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of rsk.sen@gmail.com designates 209.85.146.181 as permitted sender) Received: from [209.85.146.181] (HELO wa-out-1112.google.com) (209.85.146.181) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Jun 2007 04:53:44 -0700 Received: by wa-out-1112.google.com with SMTP id j40so87019wah for ; Wed, 20 Jun 2007 04:53:24 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=t0novFixz+ZIi2T+vfw/vCfGOwcTvVYZBgs2ImQUMyfhU0ybrnicy4TW7uMKneVOh+GPZKhPuP3/jp8KhuRpkBEdtkhKNv+fgD8Pw8qWwA3dioqwjjQNHJQfFjEiwQMlXCZSxi6x3Q/TGoUHPJqmg1jh2N9J0JqK7dsU1w+EeSY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=NiM6bJGzY9RO7a8blSszI0vfVZ2sBEBRFyKRnXdUz0P9C1bPlvjravnWkYnR4QGM0HTsPIW+KSo9LS/11q0xeFg+EA/f7KX5nXyXhPr370OAIROW5e/ByZNK9R+FG3nH5JRXMNBKig1uUznQTwmu9c+US6pD4VdV78KXRmxRfIw= Received: by 10.115.108.1 with SMTP id k1mr228258wam.1182340403884; Wed, 20 Jun 2007 04:53:23 -0700 (PDT) Received: by 10.114.94.15 with HTTP; Wed, 20 Jun 2007 04:53:23 -0700 (PDT) Message-ID: <4628d2a90706200453v7bbc3151p6b9621463404d1e7@mail.gmail.com> Date: Wed, 20 Jun 2007 17:23:23 +0530 From: "SK R" To: java-user@lucene.apache.org Subject: Re: zero termfreq for some search strings with special characters In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_95961_31895218.1182340403863" References: <4628d2a90706200224w40a4d1a7pf91ac585aae6ea99@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_95961_31895218.1182340403863 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi, Thanks for your reply. But how do I get termfreq of that term("emp-id")? Does Lucene have any other way to handle this? I appreciate any solution regarding this problem. Regards SenthilKumaran On 6/20/07, Liu_Andy2@emc.com wrote: > > You are right! > "emp-id" will be separated to two terms CONTENT:"emp" CONTENT:"id" by > standard tokenizer for indexing and searching. But direct writing term > (CONTENT:"emp-id") will not. > > Andy > > -----Original Message----- > From: SK R [mailto:rsk.sen@gmail.com] > Sent: Wednesday, June 20, 2007 5:24 PM > To: java-user@lucene.apache.org > Subject: zero termfreq for some search strings with special characters > > Hi, > I'm using standard tokenizer for both indexing and searching > process.Myindexed value is like "emp-id Aq234 kaith creating document > for search". > I can get search results for the query CONTENT:"emp-id" by using > hits = > indexSearcher.search(*query*). > But if I try to get termfrequency of that term (CONTENT:"emp-id") by > using indexreader.termdocs(new Term("CONTENT","emp-id")).freq() , 0 > results > returned. > I think because of the analyzer I can get result in 1st case but > absence > of analyzer I can't get result in 2nd case (term freq). Is it right? > How do i get correct term frequency for that term? > > > Thanks & Regards > RSK > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > ------=_Part_95961_31895218.1182340403863--