pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tilman Hausherr (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PDFBOX-4049) IllegalArgumentException: root cannot be null
Date Fri, 05 Jan 2018 17:15:00 GMT

    [ https://issues.apache.org/jira/browse/PDFBOX-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16313466#comment-16313466

Tilman Hausherr commented on PDFBOX-4049:

Your PDF starts with 
Content-Disposition: form-data; name="fileData"; filename="514bf860-d583-494c-bb0b-68e8167bba53.pdf"
Content-Type: application/pdf
and ends with
which indicates a bad MIME extracting tool. Did you receive this file from a client or did
you extract it yourself? A coworker (great guy) wrote such a tool in perl and cursed a lot.
Years later (he had quit) I rewrote such a tool myself in java with additional features and
I quickly understood his pain.

I'm asking because if you wrote that tool yourself, or if you have influence on whoever wrote
it, the whole problem would be moot.

> IllegalArgumentException: root cannot be null
> ---------------------------------------------
>                 Key: PDFBOX-4049
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4049
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.8
>         Environment: Windows 10
>            Reporter: savan patel
>              Labels: regression
>         Attachments: 372d5dd7-d4b8-41b2-9f50-80c1353aee59.pdf
> I got a pdf,,, in which pdfbox gives errors while parsing it.
> {code}
> Exception in thread "main" java.lang.IllegalArgumentException: root cannot be null
>         at org.apache.pdfbox.pdmodel.PDPageTree.<init>(PDPageTree.java:75)
>         at org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129)
>         at org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:1411)
> {code}
> This did not happen with 2.0.7.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org

View raw message