pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tilman Hausherr (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PDFBOX-958) convertToImage mangles images which were in the PDF
Date Thu, 01 May 2014 12:51:15 GMT

    [ https://issues.apache.org/jira/browse/PDFBOX-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986546#comment-13986546

Tilman Hausherr commented on PDFBOX-958:

Is there anybody who can access the PDF of this issue, and if yes, what URL works? I get an
error. I suspect it has something to do with url-encoding of the "(R)".

> convertToImage mangles images which were in the PDF
> ---------------------------------------------------
>                 Key: PDFBOX-958
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-958
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.2.1, 1.4.0, 1.5.0
>         Environment: RHEL5 and WinXP, java version "1.6.0_23"
>            Reporter: Eric Schwarzenbach
>            Assignee: Andreas Lehmkühler
>            Priority: Critical
>             Fix For: 1.6.0
>         Attachments: Image of Page 13.jpeg, Image of Page 13.png, Wrycan® Lorem Ipsum
> Of the PDFs we've tried running through PDFBox and generating page images, a number of
them (coming from disparate sources and method of creation) seem to produce images where an
image that was embedded in the page of the PDF shows somewhat mangled. It seems to be divided
by horizontal stripes, where some stripes look normal, others seem to have some kind of "smearing"
effect going on. See attached images and original PDF (image is of page 13).
> I marked this as critical as we are trying to use PDFBox in a project where page images
are crucial, and inability to produce reasonable looking page images is pretty much a deal
> The code we use to extract the images looks more or less like the following:
> 					BufferedImage image = page.convertToImage();
> 					SmartDeferredFileOutputStream outStream = new SmartDeferredFileOutputStream();
> 					String[] writerFormatNames = ImageIO.getWriterFormatNames();
> 					ImageIO.write(image, "jpeg", outStream);
> 					outStream.close()
> We've also tried specifying "png". In both "jpg" and "png" cases we get an image file
that is indeed the correct format, and both images look exactly the same. 

This message was sent by Atlassian JIRA

View raw message