Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 81868 invoked from network); 8 Aug 2005 11:34:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 8 Aug 2005 11:34:52 -0000 Received: (qmail 53949 invoked by uid 500); 8 Aug 2005 11:34:49 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 53726 invoked by uid 500); 8 Aug 2005 11:34:48 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 53713 invoked by uid 99); 8 Aug 2005 11:34:48 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2005 04:34:48 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [203.199.26.74] (HELO daakghar.controlnet.co.in) (203.199.26.74) by apache.org (qpsmtpd/0.29) with SMTP; Mon, 08 Aug 2005 04:35:10 -0700 Received: from kartik1 ([192.168.4.1]) by dakiya.controlnet.co.in (Netscape Messaging Server 4.15) with ESMTP id IKWJRR00.NJX for ; Mon, 8 Aug 2005 17:25:03 +0530 From: "Karthik N S" To: Subject: RE: Reply Split Search Word Date: Mon, 8 Aug 2005 17:14:01 +0530 Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook IMO, Build 9.0.2416 (9.0.2910.0) In-Reply-To: Importance: Normal X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi Erik I would like to reformat the Question slightly , Words without double Quotes may also be present in the String. Also I have to apply the STOP - Analyzer to filter out common English words appearing within. Do u mind giving me a bit of src hint for the same... [ I am googled out of ideas ] with regards karthik -----Original Message----- From: Erik Hatcher [mailto:erik@ehatchersolutions.com] Sent: Monday, August 08, 2005 4:23 PM To: java-user@lucene.apache.org Subject: Re: Reply Split Search Word To have an analyzer split that string into 1-5 as you have listed will require you write a custom Analyzer to tokenize with double quotes in mind like that. Erik On Aug 8, 2005, at 12:06 AM, Karthik N S wrote: > Hi > > Luceners > > Apologies..... > > As I have already replied,Using Analysis I have tried on all Analyzers > (including Standard Analyzer) > But not able to achive the required COMPLETS WORD Split. > > My I/p String would be a lengthy one as below > > String sKey = "\"" + "Dough Cutting" + "\"" + " " + "Otis > Gospodnetic" + > " " + > "\"" + "Erik Hatcher" + "\"" + " " + "Authors of > " + "\"" + > "Lucene In Action" + "\""; > > The required split of complete words should return > > 1) "Dough Cutting" > 2) Otis Gospodnetic > 3) "Erik Hatcher" > 4) Authors of > 5) "Lucene In Action" > > Plz Note :- Words with "\"" are complete split words.... > > I am shure some Analyzer code inside Lucene is handling this task. > > > som how can one achive this task.. > > with regards > Karthik > > -----Original Message----- > From: Mordo, Aviran (EXP N-NANNATEK) [mailto:aviran.mordo@lmco.com] > Sent: Friday, August 05, 2005 7:58 PM > To: java-user@lucene.apache.org > Subject: RE: Split Search Word > > > The StandardAnalyzer should work just fine with it, It will break the > search string to 5 search terms. > > HTH > > Aviran > http://www.aviransplace.com > > _____ > > From: Karthik N S [mailto:karthik@controlnet.co.in] > Sent: Friday, August 05, 2005 1:57 AM > To: LUCENE > Subject: Split Search Word > > > > Hi Luceners > > Apologies..... > > I have along Search String as given below... > > > > SearchWord = "\"" + "Dough Cutting" + "\"" + " " + "Otis > Gospodnetic" + " " + "\"" + "Erik Hatcher" + "\"" + " " + > "Authors of " + "\"" + "Lucene In Action" > +"\""; > > And prior to searching the Index ,I need the Words to be Split. > > SearchWord = > > 1) "\"" + "Dough Cutting" + "\"" > 2) "Otis Gospodnetic" > 3) "\"" + "Erik Hatcher" + "\"" > 4) "Authors of " > 5) "\"" +"Lucene In Action" +"\"" > > I am shure some Analyzer within Lucene is performin the task. > So some body please Tell me Howto > > [ I already used Analysis/Paralysis code to check ,but no help ] > > > > > WITH WARM REGARDS > HAVE A NICE DAY > [ N.S.KARTHIK] > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org