pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John Logan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PDFBOX-3153) Direct JPEG extraction results in invalid images in 2.0.0 releases.
Date Fri, 04 Dec 2015 23:47:10 GMT

     [ https://issues.apache.org/jira/browse/PDFBOX-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

John Logan updated PDFBOX-3153:
    Attachment: parents.pdf

Here is a test file with which I can reproduce the issue.

> Direct JPEG extraction results in invalid images in 2.0.0 releases.
> -------------------------------------------------------------------
>                 Key: PDFBOX-3153
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3153
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 2.0.0
>         Environment: Observed on both Linux and Mac
>            Reporter: John Logan
>              Labels: extraction, image
>         Attachments: parents.pdf
> When I run pdfbox-app ExtractImages on a PDF containing an image with a DeviceRGB colorspace,
the resulting JPEG file is very large (5.3MB, while the source PDF is 320KB).
> I see this with the 2.0.0-RC2 release, I also encounter the problem with a build from
today's trunk.
> If I modify the code to force usage of ImageIO, a valid JPEG file results.
> The image extracts properly in the 1.8.10 version.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org

View raw message