Return-Path: Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: (qmail 86969 invoked from network); 3 Nov 2010 00:53:17 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 3 Nov 2010 00:53:17 -0000 Received: (qmail 82608 invoked by uid 500); 3 Nov 2010 00:53:48 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 82587 invoked by uid 500); 3 Nov 2010 00:53:48 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 82578 invoked by uid 99); 3 Nov 2010 00:53:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Nov 2010 00:53:48 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mgravitt@me.com designates 17.148.16.103 as permitted sender) Received: from [17.148.16.103] (HELO asmtpout028.mac.com) (17.148.16.103) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Nov 2010 00:53:40 +0000 MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Received: from [192.168.0.195] (cpe-174-097-189-102.nc.res.rr.com [174.97.189.102]) by asmtp028.mac.com (Sun Java(tm) System Messaging Server 6.3-7.04 (built Sep 26 2008; 64bit)) with ESMTPSA id <0LBA002WZAFY8L20@asmtp028.mac.com> for users@pdfbox.apache.org; Tue, 02 Nov 2010 17:52:48 -0700 (PDT) X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 ipscore=0 suspectscore=1 phishscore=0 bulkscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx engine=6.0.2-1004200000 definitions=main-1011020248 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.2.15,1.0.148,0.0.0000 definitions=2010-11-02_12:2010-11-02,2010-11-02,1970-01-01 signatures=0 Subject: Re: IOException with PDFParser From: Max Gravitt In-reply-to: <4CD0AEDA.2050706@lehmi.de> Date: Tue, 02 Nov 2010 20:52:40 -0400 Content-transfer-encoding: quoted-printable Message-id: <95ABCCE1-03D9-41F7-8B56-6447EE9DABA2@me.com> References: <4CD0AEDA.2050706@lehmi.de> To: users@pdfbox.apache.org X-Mailer: Apple Mail (2.1081) Hi, I should have clarified the question. I am using this version because I = am running the library on Google App Engine and this is the version that = is compatible. If I can't make this older version compatible with the = new PDFs, is there a way to retrofit the most recent version to Google = App engine? thanks! MG On Nov 2, 2010, at 8:37 PM, Andreas Lehmk=FChler wrote: > Hi, >=20 > Am 03.11.10 01:32, schrieb Max Gravitt: >> Hi, >> I recently started to attempt to parse faxes that are PDF'd and sent = via email. I continually get the below exception with these types of = files. Does anyone have thoughts on the root cause and if there is any = workaround? >> thanks, >> MG >>=20 >> IOException >> expected=3D'endobj' firstReadAttempt=3D'' secondReadAttempt=3D'' = org.pdfbox.io.PushBackInputStream@d2f5f1 >> org.pdfbox.pdfparser.PDFParser; parseObject; 502 >> org.pdfbox.pdfparser.PDFParser; parse; 176 >> org.pdfbox.pdmodel.PDDocument; load; 707 >> com.josiejune.documentdispatch.models.Document$DocumentParser; = getPDFContents; 245 > According to the stack trace you're using a quite old (non-apache) = version of pdfbox. I suggest to update to a more recent version from [1] >=20 > BR > Andreas Lehmk=FChler >=20 >=20 > [1] hhtp://pdfbox.apache.org/download.html