Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C047118686 for ; Fri, 4 Dec 2015 20:07:21 +0000 (UTC) Received: (qmail 12556 invoked by uid 500); 4 Dec 2015 20:07:21 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 12530 invoked by uid 500); 4 Dec 2015 20:07:21 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 12509 invoked by uid 99); 4 Dec 2015 20:07:21 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2015 20:07:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id B1BCEC128A for ; Fri, 4 Dec 2015 20:07:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.445 X-Spam-Level: *** X-Spam-Status: No, score=3.445 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_MSPIKE_H2=-0.001, RP_MATCHES_RCVD=-0.554] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id XBpuS0nEeaYX for ; Fri, 4 Dec 2015 20:07:19 +0000 (UTC) Received: from mailout11.t-online.de (mailout11.t-online.de [194.25.134.85]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 97DE642B80 for ; Fri, 4 Dec 2015 20:07:19 +0000 (UTC) Received: from fwd30.aul.t-online.de (fwd30.aul.t-online.de [172.20.26.135]) by mailout11.t-online.de (Postfix) with SMTP id 53EF31D4F89 for ; Fri, 4 Dec 2015 21:07:12 +0100 (CET) Received: from [192.168.2.104] (Srg10OZBwh+5REotHe96YLuMnZscSeGMlUDs8mtGI4lZFMMR7EyYKcEjXTrwXZ9gIg@[217.231.149.46]) by fwd30.t-online.de with (TLSv1.2:ECDHE-RSA-AES256-SHA encrypted) esmtp id 1a4wdJ-0zjTk00; Fri, 4 Dec 2015 21:07:09 +0100 Subject: Re: processPages bug? To: users@pdfbox.apache.org References: <5661ED9E.50803@t-online.de> From: Tilman Hausherr Message-ID: <5661F274.8070305@t-online.de> Date: Fri, 4 Dec 2015 21:07:16 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.4.0 MIME-Version: 1.0 In-Reply-To: Content-Type: multipart/alternative; boundary="------------080407070808000908080603" X-ID: Srg10OZBwh+5REotHe96YLuMnZscSeGMlUDs8mtGI4lZFMMR7EyYKcEjXTrwXZ9gIg X-TOI-MSGID: e852880c-f0cd-4cc0-9f9a-c30ed09a498c --------------080407070808000908080603 Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Am 04.12.2015 um 20:56 schrieb britt fitch: > Awesome, thanks. That takes care of #1 & 2. > > For #3, is the check on currentPageNo necessary? > Right now processPage must be called from processPages or nothing > happens. > This has a negative effect for cases like mine where I want to > override processTextPosition and handle different pages or even if you > only want to extract data from particular pages. You can set the start and endpage through the setters setStartPage() and setEndPage(). That's the official way to do it. Tilman --------------080407070808000908080603--