Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 97068 invoked from network); 29 Mar 2004 06:43:48 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 29 Mar 2004 06:43:48 -0000 Received: (qmail 49152 invoked by uid 500); 29 Mar 2004 06:43:21 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 49128 invoked by uid 500); 29 Mar 2004 06:43:21 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 49098 invoked from network); 29 Mar 2004 06:43:19 -0000 Received: from unknown (HELO idlewild.ccnep.com.np) (202.51.64.130) by daedalus.apache.org with SMTP; 29 Mar 2004 06:43:19 -0000 Received: from chandan ([202.51.64.153]) by idlewild.ccnep.com.np (8.12.5/8.12.5) with SMTP id i2T7U0ZW018846 for ; Mon, 29 Mar 2004 13:15:10 +0545 Message-ID: <005901c41559$223f8f80$2403a8c0@chandan> From: "Chandan Tamrakar" To: "Lucene Users List" References: <4067B907.1010908@newsmonster.org> Subject: PDF indexing with CJKAnalyzer Date: Mon, 29 Mar 2004 12:28:21 +0545 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 5.50.4922.1500 X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4925.2800 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Im using a pdfbox library for indexing a PDF documents , but it doesnt not extract japanese characters. I have also tried a latest release of PDFBox as suggested on mailing list but it doenst work well unfortunately. Have anyone tried indexing a PDF document with other than english characters ? Pls. suggest thnaks, --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org