Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 54A0E180C6 for ; Wed, 29 Apr 2015 17:43:25 +0000 (UTC) Received: (qmail 94280 invoked by uid 500); 29 Apr 2015 17:43:25 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 94254 invoked by uid 500); 29 Apr 2015 17:43:25 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 94243 invoked by uid 99); 29 Apr 2015 17:43:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2015 17:43:24 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: message received from 54.76.25.247 which is an MX secondary for users@pdfbox.apache.org) Received: from [54.76.25.247] (HELO mx1-eu-west.apache.org) (54.76.25.247) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2015 17:42:59 +0000 Received: from mailout10.t-online.de (mailout10.t-online.de [194.25.134.21]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id DCD6F2AABF for ; Wed, 29 Apr 2015 17:42:57 +0000 (UTC) Received: from fwd09.aul.t-online.de (fwd09.aul.t-online.de [172.20.27.151]) by mailout10.t-online.de (Postfix) with SMTP id 6AC365FF1 for ; Wed, 29 Apr 2015 19:42:51 +0200 (CEST) Received: from [192.168.2.102] (rC-rtrZ-8hBfJm9q9xg8+M5RSYmQEKMO0UvLhIjJ8IhOVhGpVCrWg16cSiXpLz6QFE@[217.231.145.104]) by fwd09.t-online.de with (TLSv1.2:ECDHE-RSA-AES256-SHA encrypted) esmtp id 1YnW0N-41z6cC0; Wed, 29 Apr 2015 19:42:39 +0200 Message-ID: <5541180E.5050607@t-online.de> Date: Wed, 29 Apr 2015 19:42:38 +0200 From: Tilman Hausherr Reply-To: users@pdfbox.apache.org User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.6.0 MIME-Version: 1.0 To: users@pdfbox.apache.org Subject: Re: Picture not printed References: <553FB0E3.9030804@t-online.de> <5540059E.4040801@t-online.de> <554070E6.4030603@t-online.de> <5540FF9C.1090901@t-online.de> In-Reply-To: <5540FF9C.1090901@t-online.de> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-ID: rC-rtrZ-8hBfJm9q9xg8+M5RSYmQEKMO0UvLhIjJ8IhOVhGpVCrWg16cSiXpLz6QFE X-TOI-MSGID: 09ec193b-fb82-4032-8097-c245494aa471 X-Virus-Checked: Checked by ClamAV on apache.org Hi, Done. You'll find a new version here: https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/1.8.10-SNAPSHOT/ About the creator of that PDF - if it was done by a partner of your company, please contact the vendor (Datalab "Pantheon PDF export") and point them to https://issues.apache.org/jira/browse/PDFBOX-2501 http://bugs.ghostscript.com/show_bug.cgi?format=multiple&id=690734 Tilman Am 29.04.2015 um 17:58 schrieb Tilman Hausherr: > In the meantime the developer of twelvemonkeys answered, and pointed > me to the real reason: not the JPEG is malformed, it is the PDF. This > bug is described in > https://issues.apache.org/jira/browse/PDFBOX-2501 > but solved for 2.0 only. I'll apply for 1.8 and then come back to you. > > Btw, that file, is it somehow confidential? > > Tilman > > Am 29.04.2015 um 13:36 schrieb Gregor Kovač: >> Hi! >> >> I see. Great. >> >> Thanks for all your help. >> >> Best regards, >> Kovi >> >> 2015-04-29 7:49 GMT+02:00 Tilman Hausherr : >> >>> Hi, >>> >>> The plan was to release it when all, or at least the blocker issues are >>> solved: >>> >>> https://issues.apache.org/jira/issues/?jql=fixVersion%20%3D%202.0.0%20AND%20project%20%3D%20PDFBOX%20AND%20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C%20key%20DESC%2C%20priority%20DESC >>> >>> >>> How I extracted the image - I changed the source code to save the file >>> before attempting to decode it. I inserted this code at the >>> beginning of >>> decode() in JPXFilter.java in the 1.8 version: >>> >>> ByteArrayOutputStream baos = new ByteArrayOutputStream(); >>> IOUtils.copy(compressedData, baos); >>> String filename = System.currentTimeMillis()+".jpg"; >>> FileOutputStream fos = new FileOutputStream(new >>> File("INSERT___DIRECTORY____",filename)); >>> fos.write(baos.toByteArray()); >>> fos.close(); >>> compressedData = new ByteArrayInputStream(baos.toByteArray()); >>> System.out.println("jpeg filename: " + filename); >>> >>> They might also be extracted by the ExtractImages tool of the 2.0 >>> version >>> (not by the 1.8 version, that one doesn't support inline images). >>> >>> Here is one of the JPEG files: >>> http://home.snafu.de/tilman/tmp/1430258221574.jpg >>> >>> I also opened an issue with twelvemonkeys, as it can't read the file >>> either with ImageIO.read(), and the developer may give some insight >>> about >>> that file. >>> https://github.com/haraldk/TwelveMonkeys/issues/132 >>> >>> Tilman >>> >>> >>> Am 29.04.2015 um 01:20 schrieb Gregor Kovač: >>> >>>> Hi! >>>> >>>> I tried version 2.0 and it does print the picture. Nice. >>>> Is ETA for 2.0 set? How did you extract images from PDF? >>>> >>>> Best regards, >>>> Kovi >>>> >>>> 2015-04-29 0:11 GMT+02:00 Tilman Hausherr : >>>> >>>> This is a problem with the reading of JPEG images in 1.8 that is >>>> fixed in >>>>> the unreleased 2.0 (where we use a different strategy to read JPEG >>>>> files), >>>>> but not in 1.8. I confirmed this by extracting one of the inline >>>>> images >>>>> that brings the problem. It can't be read with ImageIO.read(), not >>>>> with >>>>> the >>>>> sun version and not with the twelvemonkeys (which usuallly can >>>>> read all). >>>>> >>>>> Try the 2.0 version: >>>>> https://pdfbox.apache.org/downloads.html#scm >>>>> Note hat the API is different, see in the examples. >>>>> >>>>> Tilman >>>>> >>>>> >>>>> Am 28.04.2015 um 23:05 schrieb Gregor Kovač: >>>>> >>>>> Hi! >>>>>> At this link >>>>>> >>>>>> >>>>>> https://drive.google.com/file/d/0B-qoYfDTAZ8QN3V4ang4bEJIN3c/view?usp=sharing >>>>>> >>>>>> is original PDF >>>>>> At this link >>>>>> >>>>>> >>>>>> https://drive.google.com/file/d/0B-qoYfDTAZ8QcjRBMWRBZE50Snc/view?usp=sharing >>>>>> >>>>>> is what gets printed. >>>>>> >>>>>> Best regards, >>>>>> Kovi >>>>>> >>>>>> 2015-04-28 18:10 GMT+02:00 Tilman Hausherr : >>>>>> >>>>>> Hi, >>>>>> >>>>>>> PDF files are not attached in the mailing list. Can you upload >>>>>>> it to a >>>>>>> public place? >>>>>>> >>>>>>> Tilman >>>>>>> >>>>>>> >>>>>>> Am 28.04.2015 um 10:48 schrieb Gregor Kovač: >>>>>>> >>>>>>> Hi! >>>>>>> >>>>>>>> I've been using PDFBox 1.8.9 for printing PDF files. In the >>>>>>>> attachment >>>>>>>> (eSlog_380_1031000000002_012354698725.pdf) is a PDF file with one >>>>>>>> picture >>>>>>>> in upper left corner, that does not get printed out at all >>>>>>>> (attachment >>>>>>>> "Java Printing.pdf"). There is a blank space instead of picture. >>>>>>>> >>>>>>>> Why? >>>>>>>> >>>>>>>> This is the code I use for printing: >>>>>>>> private static void printWithPDFBox(File pdfFile, int copies) { >>>>>>>> boolean silentPrint = true; >>>>>>>> String printerName = null; >>>>>>>> String password = null; >>>>>>>> PDDocument document = null; >>>>>>>> try { >>>>>>>> document = PDDocument.load(pdfFile); >>>>>>>> >>>>>>>> if (document.isEncrypted()) { >>>>>>>> document.decrypt(password); >>>>>>>> } >>>>>>>> >>>>>>>> PrinterJob printJob = PrinterJob.getPrinterJob(); >>>>>>>> printJob.setJobName(pdfFile.getName()); >>>>>>>> >>>>>>>> if (printerName != null) { >>>>>>>> PrintService[] printService = >>>>>>>> PrinterJob.lookupPrintServices(); >>>>>>>> boolean printerFound = false; >>>>>>>> for (int i = 0; !printerFound && i < >>>>>>>> printService.length; >>>>>>>> i++) { >>>>>>>> if >>>>>>>> (printService[i].getName().indexOf(printerName) >>>>>>>> != >>>>>>>> -1) { >>>>>>>> printJob.setPrintService(printService[i]); >>>>>>>> printerFound = true; >>>>>>>> } >>>>>>>> } >>>>>>>> } >>>>>>>> >>>>>>>> printJob.setPageable(new PDPageable(document, >>>>>>>> printJob)); >>>>>>>> printJob.setCopies(copies); >>>>>>>> if (silentPrint || printJob.printDialog()) { >>>>>>>> printJob.print(); >>>>>>>> } >>>>>>>> } catch (Exception e) { >>>>>>>> e.printStackTrace(); >>>>>>>> } finally { >>>>>>>> if (document != null) { >>>>>>>> try { >>>>>>>> document.close(); >>>>>>>> } catch (IOException e) { >>>>>>>> e.printStackTrace(); >>>>>>>> } >>>>>>>> } >>>>>>>> } >>>>>>>> } >>>>>>>> >>>>>>>> Best regards, >>>>>>>> Kovi >>>>>>>> >>>>>>>> -- >>>>>>>> -~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~ >>>>>>>> | In A World Without Fences Who Needs Gates? | >>>>>>>> | Experience Linux. | >>>>>>>> -~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~ >>>>>>>> >>>>>>>> >>>>>>>> --------------------------------------------------------------------- >>>>>>>> >>>>>>>> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org >>>>>>>> For additional commands, e-mail: users-help@pdfbox.apache.org >>>>>>>> >>>>>>>> >>>>>>>> --------------------------------------------------------------------- >>>>>>>> >>>>> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org >>>>> For additional commands, e-mail: users-help@pdfbox.apache.org >>>>> >>>>> >>>>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org >>> For additional commands, e-mail: users-help@pdfbox.apache.org >>> >>> >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org > For additional commands, e-mail: users-help@pdfbox.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org For additional commands, e-mail: users-help@pdfbox.apache.org