Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 64317 invoked from network); 19 Oct 2004 10:05:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 19 Oct 2004 10:05:27 -0000 Received: (qmail 22779 invoked by uid 500); 19 Oct 2004 10:05:15 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 22750 invoked by uid 500); 19 Oct 2004 10:05:14 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 22735 invoked by uid 99); 19 Oct 2004 10:05:14 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from [203.200.20.178] (HELO crimsonlogic.co.in) (203.200.20.178) by apache.org (qpsmtpd/0.28) with ESMTP; Tue, 19 Oct 2004 03:05:11 -0700 Received: from INA098tnat (localhost.localdomain [127.0.0.1]) by crimsonlogic.co.in (Postfix) with ESMTP id 3241B233DF for ; Tue, 19 Oct 2004 15:31:18 +0530 (IST) From: "Natarajan.T" To: "'Lucene Users List'" Subject: RE: Search Help in word doc Date: Tue, 19 Oct 2004 15:39:50 +0530 Message-ID: <002d01c4b5c3$c5480c60$8714a8c0@ssl> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.4024 In-Reply-To: <5933531F07A5144E9E53E1AA45257E4B0116F779@wexchbe03.ancy.fr.sopra> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106 Importance: Normal X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Ok, Thanks a lot... -----Original Message----- From: Cocula Remi [mailto:rcocula@sopragroup.com]=20 Sent: Tuesday, October 19, 2004 3:14 PM To: Lucene Users List Subject: RE: Search Help in word doc In my case, search. But probably that the best is to do it at indexing time. -----Message d'origine----- De : Natarajan.T [mailto:natarajant@crimsonlogic.co.in] Envoy=E9 : mardi 19 octobre 2004 11:41 =C0 : 'Lucene Users List' Objet : RE: Search Help in word doc Are you doing this functionality under indexing part or search part -----Original Message----- From: Cocula Remi [mailto:rcocula@sopragroup.com]=20 Sent: Tuesday, October 19, 2004 2:37 PM To: Lucene Users List Subject: RE: Search Help in word doc This sample code changes undesired characters into underscores. Document doc =3D .... char[] cs =3D doc.get("content").toCharArray(); StringBuffer sb =3D new StringBuffer(); for (int j=3D0;j< Array.getLength(cs);j++) { if (!Character.isISOControl(cs[j])) { sb.append(cs[j]); } else { sb.append(" _ "); } } System.out.println(sb.toString()); -----Message d'origine----- De : Natarajan.T [mailto:natarajant@crimsonlogic.co.in] Envoy=E9 : mardi 19 octobre 2004 11:06 =C0 : 'Lucene Users List' Objet : RE: Search Help in word doc Hi Remi, Thanks for your response... Pls send me the jar name with sample code..... Thanks, Natarajan. -----Original Message----- From: Cocula Remi [mailto:rcocula@sopragroup.com]=20 Sent: Tuesday, October 19, 2004 2:26 PM To: Lucene Users List Subject: RE: Search Help in word doc Seen that. I use the Character.isISOControl() function to identify and remove these characters. -----Message d'origine----- De : Natarajan.T [mailto:natarajant@crimsonlogic.co.in] Envoy=E9 : mardi 19 octobre 2004 10:37 =C0 : lucene-user@jakarta.apache.org Objet : Search Help in word doc Hi FFI, =20 I am indexing multiple documents like (word,excel,html,ppt,pdf) at the time of indexing there is no problem..... =20 My search results contents(description) comes with small Boxes(this is happening only word documents) =20 I think this is happening because of some special characters like(bullets and symbols....) =20 How can I rectify this problem??? =20 Regards, Natarajan. =20 --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org