Return-Path: X-Original-To: apmail-pdfbox-dev-archive@www.apache.org Delivered-To: apmail-pdfbox-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 527E9183B3 for ; Tue, 1 Dec 2015 01:29:16 +0000 (UTC) Received: (qmail 83432 invoked by uid 500); 1 Dec 2015 01:29:11 -0000 Delivered-To: apmail-pdfbox-dev-archive@pdfbox.apache.org Received: (qmail 83329 invoked by uid 500); 1 Dec 2015 01:29:11 -0000 Mailing-List: contact dev-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pdfbox.apache.org Delivered-To: mailing list dev@pdfbox.apache.org Received: (qmail 83262 invoked by uid 99); 1 Dec 2015 01:29:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Dec 2015 01:29:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id EF2382C1F61 for ; Tue, 1 Dec 2015 01:29:10 +0000 (UTC) Date: Tue, 1 Dec 2015 01:29:10 +0000 (UTC) From: "Jim deVos (JIRA)" To: dev@pdfbox.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (PDFBOX-3142) PDFMergerUtility generates result with blank pages for certain source files. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Jim deVos created PDFBOX-3142: --------------------------------- Summary: PDFMergerUtility generates result with blank pages for certain source files. Key: PDFBOX-3142 URL: https://issues.apache.org/jira/browse/PDFBOX-3142 Project: PDFBox Issue Type: Bug Components: Utilities Affects Versions: 1.8.10 Environment: Ubuntu 14.04.3, java 1.8.0_66 Reporter: Jim deVos My team uses PDFMergerUtility to attach cover pages to various pdfs . We recently we tried utilizing a scratch file (e.g. PDFMergerUtility.mergeNonSeq()) to cut down on the amount of RAM we are using. This approach works for the majority of pdf's in our system, but some files cause the merger utility to generate resultant pdf's with a blank page. Specifically, the result pdf contains a blank page after the coverpage instead of the first page of the second document sent to merger utility. Whenever this problem occurs, we see the following line in our logs: {{org.apache.pdfbox.pdfparser.NonSequentialPDFParser - Can't find the object 52 0 (origin offset 7187557)}} I'll try to attach/link an example pdf soon, but currently I don't have permission to redistribute any files that exhibit the problem. However, here's a simple snippet that replicates the problem - it's pretty straightforward. {code} @Test public void testMergeNonSeq() throws IOException, COSVisitorException { destinationPdf = new File(TMP_FOLDER, "result-nonseq.pdf"); PDFMergerUtility ut = new PDFMergerUtility(); RandomAccess ram = new RandomAccessFile(File.createTempFile("mergeram", ".bin"), "rw"); ut.addSource(coverpagePdf); ut.addSource(documentPdf); ut.setDestinationFileName(destinationPdf.getCanonicalPath()); ut.mergeDocumentsNonSeq(ram); //the only automated way we have to tell that something went wrong is to check the size of the result assertThat("destination pdf should be larger than the original pdf", destinationPdf.length(), is( greaterThan(documentPdf.length()))); } {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org For additional commands, e-mail: dev-help@pdfbox.apache.org