Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 22415 invoked from network); 6 Dec 2010 13:42:52 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 Dec 2010 13:42:52 -0000 Received: (qmail 71987 invoked by uid 500); 6 Dec 2010 13:42:50 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 71796 invoked by uid 500); 6 Dec 2010 13:42:49 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Delivered-To: moderator for java-user@lucene.apache.org Received: (qmail 14321 invoked by uid 99); 6 Dec 2010 12:54:22 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) From: Ranjit Kumar To: "java-user@lucene.apache.org" Subject: lucene3.0.2: getting incorrect no. of occurrence in file Thread-Topic: lucene3.0.2: getting incorrect no. of occurrence in file Thread-Index: AcuVROWCkK6Ji6sBRzaCPbeaGaxyHw== Date: Mon, 6 Dec 2010 12:55:45 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: yes X-MS-TNEF-Correlator: x-cr-hashedpuzzle: K/Q= ADfl BQ63 BTOH CILn CKPT EprP FEfk F2gV GER3 Haok Iyyv JWuj Jot8 KC84 Kd6c;1;agBhAHYAYQAtAHUAcwBlAHIAQABsAHUAYwBlAG4AZQAuAGEAcABhAGMAaABlAC4AbwByAGcA;Sosha1_v1;7;{B2BD3BCA-01A2-49A8-AB2B-AC218CA668C4};cgBhAG4AagBpAHQALgBrAHUAbQBhAHIAQABvAHQAcwBzAG8AbAB1AHQAaQBvAG4AcwAuAGMAbwBtAA==;Mon, 06 Dec 2010 12:55:45 GMT;bAB1AGMAZQBuAGUAMwAuADAALgAyADoAIABnAGUAdAB0AGkAbgBnACAAaQBuAGMAbwByAHIAZQBjAHQAIABuAG8ALgAgAG8AZgAgAG8AYwBjAHUAcgByAGUAbgBjAGUAIABpAG4AIABmAGkAbABlAA== x-cr-puzzleid: {B2BD3BCA-01A2-49A8-AB2B-AC218CA668C4} Content-Type: multipart/related; boundary="_004_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_"; type="multipart/alternative" MIME-Version: 1.0 --_004_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_ Content-Type: multipart/alternative; boundary="_000_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_" --_000_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi, I am facing same problem with lucen3.0.2 search. I am using StandardAnalyz= er to create index. IndexWriter writer =3D new IndexWriter(FSDirectory.open(INDEX_DIR), new Sta= ndardAnalyzer(Version.LUCENE_CURRENT), true, new IndexWriter.MaxFieldLength= (1000000)); on the other hand for search I am using same analyser. In case of term quer= y(ie; query having single word) result is fine and gives correct no of occu= rrence in the file searched. When using phrase query (ie; multiTerm query like sql server) it gives fi= le that contents sql server (no. of docoments) but do not gives correct no= of occurrence in each document. If you have any solution plz.. help me out. Thanks & Regards, Ranjit Kumar Associate Software Engineer [cid:image002.jpg@01CB7089.C0069B40] US: +1 408.540.0001 UK: +44 208.099.1660 India: +91 124.474.8100 | +91 124.410.1350 FAX: +1 408.516.9050 http://www.otssolutions.com =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Pr= ivate, Confidential and Privileged. This e-mail and any files and attachmen= ts transmitted with it are confidential and/or privileged. They are intende= d solely for the use of the intended recipient. The content of this e-mail = and any file or attachment transmitted with it may have been changed or alt= ered without the consent of the author. If you are not the intended recipie= nt, please note that any review, dissemination, disclosure, alteration, pri= nting, circulation or Transmission of this e-mail and/or any file or attach= ment transmitted with it, is prohibited and may be unlawful. If you have re= ceived this e-mail or any file or attachment transmitted with it in error p= lease notify OTS Solutions at info@otssolutions.com =3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --_000_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Hi,

I am facing same problem = with lucen3.0.2  search. I am using StandardAnalyzer to create index.

IndexWriter writer =3D new IndexWriter(FSDirectory.open(INDEX_DIR), new St= andardAnalyzer(Version.LUCENE_CURRENT), true, new IndexWriter.MaxFieldLengt= h(1000000));

on the o= ther hand for search I am using same analyser. In case of term query(ie; qu= ery having single word) result is fine and gives correct no of occurrence i= n the file searched.

When usi= ng  phrase query (ie;  multiTerm query like sql server) it gives file that contents  sql server (no. of = docoments) but do not gives correct no of occurrence in each document.

If you have any solution plz.. help me out<= /b>.

 

 

Thanks & Regards,

Ranjit Kumar        &= nbsp;           &nbs= p; 

Associate Software Engineer

 

3D"cid:image002.jpg@01CB7089.C0069B=

 

US:       +1 408.540.0001

UK:       +44 208.099.1660=

India:   +91 124.474.8100 | +91 124.410.1350

FAX:=   &n= bsp; +1 408.516.9050

http://www.otssolutions.com

 

=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Pr= ivate, Confidential and Privileged. This e-mail and any files and attachmen= ts transmitted with it are confidential and/or privileged. They are intende= d solely for the use of the intended recipient. The content of this e-mail and any = file or attachment transmitted with it may have been changed or altered wit= hout the consent of the author. If you are not the intended recipient, plea= se note that any review, dissemination, disclosure, alteration, printing, circulation or Transmission of this e-ma= il and/or any file or attachment transmitted with it, is prohibited and may= be unlawful. If you have received this e-mail or any file or attachment tr= ansmitted with it in error please notify OTS Solutions at info@otssolutions.com =3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --_000_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_-- --_004_AD21521D44870544AC66F4FDFC0A474C0FC8B68CMAILotssolution_--