Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3302F10665 for ; Thu, 26 Sep 2013 15:37:59 +0000 (UTC) Received: (qmail 97735 invoked by uid 500); 26 Sep 2013 15:37:58 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 97613 invoked by uid 500); 26 Sep 2013 15:37:55 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 97477 invoked by uid 99); 26 Sep 2013 15:37:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Sep 2013 15:37:53 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of darcy.dechene@gmail.com designates 209.85.220.52 as permitted sender) Received: from [209.85.220.52] (HELO mail-pa0-f52.google.com) (209.85.220.52) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Sep 2013 15:37:45 +0000 Received: by mail-pa0-f52.google.com with SMTP id kl14so1472346pab.11 for ; Thu, 26 Sep 2013 08:37:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=aNlSOKxf+LsjZ+biO93FwEf3pv1Xvet6aBs1re907U0=; b=lwlZL7srjAK0VMlouU5fR3n9GpYzSUF9qTCAMexp/fFaw93K8Zj2cr7rAtAZn25LQI Hqp7ir0hkl5m6093yJY+PcQsNNqZi1kXLlgS555iF/01K2xV1XbxxycZ/sgkrvZvtFwm tfCowwoqa2j8Nplk5wt+UJj8UYZCM3X+F/pQneCs+EAqGM3UQGO3i08vVjmxIgmWvu7h +SJ8yBNSXnvd8671fygrn933cagM30uifUmfWp9v55VjfHc449qgVAPMm9aHkgr2wIJx ZbicaXOasFq7fCVczCpzKQ4R2liMsnNDQpEETXgvUKIEYHjUjul+mweoL/67Z4AuIjjI EUcg== MIME-Version: 1.0 X-Received: by 10.66.25.70 with SMTP id a6mr6133085pag.68.1380209842972; Thu, 26 Sep 2013 08:37:22 -0700 (PDT) Received: by 10.66.19.225 with HTTP; Thu, 26 Sep 2013 08:37:22 -0700 (PDT) Date: Thu, 26 Sep 2013 09:37:22 -0600 Message-ID: Subject: PDFToImage results in a missing image From: Darcy Dechene To: users@pdfbox.apache.org Content-Type: multipart/mixed; boundary=bcaec5299283681beb04e74b25a1 X-Virus-Checked: Checked by ClamAV on apache.org --bcaec5299283681beb04e74b25a1 Content-Type: multipart/alternative; boundary=bcaec5299283681be704e74b259f --bcaec5299283681be704e74b259f Content-Type: text/plain; charset=ISO-8859-1 Hi, I'm using PDFToImage to convert PDFs to PNGs (version 1.8.2). When running the conversion for a particular PDF, I get the below Exception and part of the PDF (an image) is missing in the resulting PNG. Looking at the Exception there is probably a TIFF (contained in the PDF) that is somewhat malformed. However, I'm wondering if there is some sort of work around that can be used, to deal with the problem? I found this realy old discussion (from 2008) with a similar issue: http://itext-general.2136553.n4.nabble.com/Invalid-TIFF-Invalid-code-encountered-while-decoding-2D-group-4-compressed-data-td2159262.html Exception: java.lang.RuntimeException: Invalid code encountered while decoding 2D group 4 compressed data. at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeT6(TIFFFaxDecoder.java:1073) at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:126) at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:295) at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:245) at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:172) at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGBImage(PDCcitt.java:155) at org.apache.pdfbox.util.operator.pagedrawer.Invoke.process(Invoke.java:83) at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:554) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:268) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:235) at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:215) at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:125) at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:781) at org.apache.pdfbox.util.PDFImageWriter.writeImage(PDFImageWriter.java:133) at org.apache.pdfbox.PDFToImage.main(PDFToImage.java:244) at org.apache.pdfbox.PDFBox.main(PDFBox.java:96) I've attached the PDF and PNG, but I'm not sure if they will come through. Thanks, Darcy --bcaec5299283681be704e74b259f Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi,

I'm using=A0PDFToImage =A0to co= nvert PDFs to PNGs (version=A01.8.2). =A0 When running the conversion for a= particular PDF, I get the below Exception and part of the PDF (an image) i= s missing in the resulting PNG.

Looking at the Exception there is probably a TIFF (cont= ained in the PDF) that is somewhat malformed. However, I'm wondering if= there is some sort of work=A0around=A0that can be used, to deal with the p= roblem?

I found this realy old=A0discussion (from 2008) with a= =A0similar=A0issue:

Exception:
java.lang.RuntimeExcept= ion: Invalid code encountered while decoding 2D group 4 compressed data.
at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeT6(TIFFFaxDecoder.= java:1073)
at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecode= Filter.java:126)
at org.apache.pdfbox.cos.COSStream.doDecode(COSS= tream.java:295)
at org.apache.pdfbox.cos.COSStream.doDecode(COSSt= ream.java:245)
at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:= 172)
at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGB= Image(PDCcitt.java:155)
at org.apache.pdfbox.util.operator.pagedr= awer.Invoke.process(Invoke.java:83)
at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEng= ine.java:554)
at org.apache.pdfbox.util.PDFStreamEngine.processSu= bStream(PDFStreamEngine.java:268)
at org.apache.pdfbox.util.PDFSt= reamEngine.processSubStream(PDFStreamEngine.java:235)
at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngin= e.java:215)
at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(Pa= geDrawer.java:125)
at org.apache.pdfbox.pdmodel.PDPage.convertToI= mage(PDPage.java:781)
at org.apache.pdfbox.util.PDFImageWriter.writeImage(PDFImageWriter.jav= a:133)
at org.apache.pdfbox.PDFToImage.main(PDFToImage.java:244)<= /div>
at org.apache.pdfbox.PDFBox.main(PDFBox.java:96)

I've attached the PDF and PNG, but I'm not sure if t= hey will come through.

Thanks,
Darcy
--bcaec5299283681be704e74b259f-- --bcaec5299283681beb04e74b25a1--