Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 20283 invoked from network); 14 Oct 2002 09:32:36 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 14 Oct 2002 09:32:36 -0000 Received: (qmail 10788 invoked by uid 97); 14 Oct 2002 09:33:33 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 10634 invoked by uid 97); 14 Oct 2002 09:33:30 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 10584 invoked by uid 98); 14 Oct 2002 09:33:28 -0000 X-Antivirus: nagoya (v4218 created Aug 14 2002) Message-ID: From: Vinod Bhagat To: 'Lucene Users List' Subject: Extracting Complete Text from PDF using Lucene and JPEDAL!!!! Date: Mon, 14 Oct 2002 05:26:32 -0400 MIME-Version: 1.0 X-Mailer: Internet Mail Service (5.5.2653.19) Content-Type: text/plain X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Dear People I am using Lucene and one of the requirement is to index PDF. I am using JPEDAL's API to extract text from PDF. Till now i manage to get the text of the first page, I am using the ExtractTextObject.java class to do the above. But i want to extract the complete text of the PDF file. Have anyone done this and possible could guide me towards it. Appritiate for your positive and quick reply. Cheers Vin. -- To unsubscribe, e-mail: For additional commands, e-mail: