pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ross Johnson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PDFBOX-4283) Allowing Rectangles with additional elements
Date Wed, 01 Aug 2018 20:14:00 GMT

    [ https://issues.apache.org/jira/browse/PDFBOX-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16565914#comment-16565914

Ross Johnson commented on PDFBOX-4283:

The problem PDF itself is confidential, and about 250 MB, but I've saved off one of the problem
pages into a new PDDocument which I've uploaded above. Adobe Acrobat Reader shows the page
as totally blank, and in fact all of the 4 pages that had this issue show as blank in Reader.
I believe that these pages are intended to be blank, but that may be wrong.

> Allowing Rectangles with additional elements
> --------------------------------------------
>                 Key: PDFBOX-4283
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4283
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: PDModel
>    Affects Versions: 2.0.11
>            Reporter: Ross Johnson
>            Priority: Minor
>         Attachments: weird-rectangle.pdf
> I've come across some pages in a large PDF that have some additional, non-numerical elements
at the end of the MediaBox rectangle array, e.g. 
> {code:java}
> /MediaBox [0 0 612 792 5 0 R 6 0 R]
> {code}
> Trying to read such a structure with PDPage.getMediaBox() throws an exception trying
to construct the PDRectangle at [this line|[https://github.com/apache/pdfbox/blob/6f18d7c4bef4d23a22dcf14c804d737d43908deb/pdfbox/src/main/java/org/apache/pdfbox/pdmodel/common/PDRectangle.java#L131].]
> I'm not sure if this strange case should be treated as a file issue, or if should be
supported by slicing / shortening the COSArray prior to trying to convert to floats. Acrobat
Reader shows the pages without complaint.
> The original PDF was produced by Foxit PhantomPDF Printer Version

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org

View raw message