Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9CF429713 for ; Tue, 6 Mar 2012 07:35:02 +0000 (UTC) Received: (qmail 32134 invoked by uid 500); 6 Mar 2012 07:35:02 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 31884 invoked by uid 500); 6 Mar 2012 07:34:57 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 31872 invoked by uid 99); 6 Mar 2012 07:34:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Mar 2012 07:34:57 +0000 X-ASF-Spam-Status: No, hits=4.7 required=5.0 tests=FREEMAIL_FORGED_REPLYTO,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.139.213.139] (HELO nm27-vm0.bullet.mail.bf1.yahoo.com) (98.139.213.139) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 06 Mar 2012 07:34:47 +0000 Received: from [98.139.212.146] by nm27.bullet.mail.bf1.yahoo.com with NNFMP; 06 Mar 2012 07:34:26 -0000 Received: from [98.139.212.204] by tm3.bullet.mail.bf1.yahoo.com with NNFMP; 06 Mar 2012 07:34:26 -0000 Received: from [127.0.0.1] by omp1013.mail.bf1.yahoo.com with NNFMP; 06 Mar 2012 07:34:26 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 685736.21001.bm@omp1013.mail.bf1.yahoo.com Received: (qmail 53045 invoked by uid 60001); 6 Mar 2012 07:34:26 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1331019266; bh=uYSHKncYR2kLq9OMl3qeubC7wDU3l9zMJoLzzUTGB0U=; h=X-YMail-OSG:Received:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=yl/UoAsTMKnzvnWU+Qmegej6INksJHI7a33vfoQpUHTc3CKf0rn01lTkJriTIE8C5c7/q2v2Jad04ARa/pjzuMh2GphWYmOxP6C2A0w8UIwRCnJR/IhPqzY1lmE1kB5xozSBPTnc+HiIsXiZiUkdbR0Ev8rlHXQsiS6Ds5W6Zo4= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=WjgPgkAKLrZLOrMm+H/DDq0ZfGMVE+G3ZoWLsq484INVZR2hrdffItzfWxVJSW0tHM0cYXBPALph1cqvTIWIFzplog000u5E3zT1wgzNwzQRIuW8FB6HUAcFM16GmhgVyKi5uNu59u+iKt8ECFWKA9oTzR0mJd/NcFqUFEO2Wgg=; X-YMail-OSG: mdUi7xMVM1nuum5Eh9l0fq2HPAYZFyfN2yOXM3b793K.j7P cBWi85Hqy42yzWhwhCvJEkLKVW6BMjBA7xAUOpTjYwYyoGqLRYepbvs9w7n6 uhDAc4nub6Pfwf9ZKoiCAAyi3ZU25JPrfrLZ4jdU1zROCRd09ZdPjKM9QNGo el9DTkyTp6wqylOr_qiUFBoG9w5fbUXcNK7pQ9P8_1CzUbL.uYd9a47I7Pwm OY9zJz3siw_TQfRAnyTUalJHiWhPulUaPizoOEHhV_3z8uCyVeh5mTTPxmy1 OUiMI05LknN3k5CS4fMUUB9ZjWWxcrV4nH5XEyUESU0BKzkmWW97jNrCXMuk OsesFB71HLu5.37pjciwGD5HSY9cwHeQiDrz5DimAf2JPeYXG9tNc2PlzwHz 3EUsU.q_ZfmCWnZIhqj_KBKx.629B2LVIah3uJ56gWsGIR7MRqQ1eh7otacv S5mPBAFFJI4xL1qBNRLl16E_dRCbw.ncfbkCNQwMfGjI4F2sGq9CyNA7njU6 bMJlVIVYaNfCp9RUZlbApwXAFf9l0fJAmch2Avd08pCopA2gAR2gz6L.DyA7 Rp0vzg9UfWRZRPLNV_vHL0_KDSxtgXRBgMzJPWJ_OIuH2vvv_ZuMFtcidjsb wEN5UssInnEHnfIf35r4LfRxSFk1D5yRKzDLqQ97SvifteNbeMOPG Received: from [81.246.37.187] by web162003.mail.bf1.yahoo.com via HTTP; Mon, 05 Mar 2012 23:34:26 PST X-Mailer: YahooMailWebService/0.8.116.338427 References: <1331018982.48821.YahooMailNeo@web162003.mail.bf1.yahoo.com> Message-ID: <1331019266.39709.YahooMailNeo@web162003.mail.bf1.yahoo.com> Date: Mon, 5 Mar 2012 23:34:26 -0800 (PST) From: Shriram Reply-To: Shriram Subject: Extracting text between two bookmarks using Apache PdfBox To: "users@pdfbox.apache.org" In-Reply-To: <1331018982.48821.YahooMailNeo@web162003.mail.bf1.yahoo.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="1705842018-214823291-1331019266=:39709" X-Virus-Checked: Checked by ClamAV on apache.org --1705842018-214823291-1331019266=:39709 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable I am using Apache PDFBox to read a PDF document which has a hierarchy, whic= h is defined by the bookmarks. The hierarchy is in a tree form with content= s only at the leaf level. When I try to extract the text between two leaf l= evel bookmarks(using Stripper.setStartBookmark(), Stripper.setEndBookmark()= and Stripper.writeText()), I get the text in the whole page instead. In sh= ort, my problem is similar to that mentioned in=A0http://www.java-forums.or= g/advanced-java/51032-pdox-1-6-0-extract-text-between-2-bookmarks-same-page= -sos.html=0A=0AIs there a way to extract the contents between two bookmarks= ? If so, what should be the change in my code? --1705842018-214823291-1331019266=:39709--