pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "moh.sushi" <moh.su...@googlemail.com>
Subject pdf => image ; inline images are not converted
Date Thu, 10 Jan 2013 10:27:43 GMT
Hello,

we are using pdfbox for converting pdf-pages into images.
Some documents can not be converted fully - the "inline"-images are the
problem.

extracts of the logs are:

[WARN] 2013-01-10 10:45:26,491 [main]
org.apache.pdfbox.util.operator.SetNonStrokingColor - Not supported
colorspace Indexed within operator scn
[ERROR] 2013-01-10 10:45:26,507 [main]
org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap -
java.lang.IndexOutOfBoundsException: Index: 12, Size: 12
java.lang.IndexOutOfBoundsException: Index: 12, Size: 12
at java.util.ArrayList.RangeCheck(ArrayList.java:547)
at java.util.ArrayList.get(ArrayList.java:322)
at org.apache.pdfbox.io.RandomAccessBuffer.seek(RandomAccessBuffer.java:84)
at
org.apache.pdfbox.io.RandomAccessFileOutputStream.write(RandomAccessFileOutputStream.java:116)
at
org.apache.pdfbox.filter.RunLengthDecodeFilter.decode(RunLengthDecodeFilter.java:92)
at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:279)
at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:221)
at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:156)
at
org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:214)
at org.apache.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:468)
at
org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:185)
at org.apache.pdfbox.util.operator.pagedrawer.Invoke.process(Invoke.java:83)
at
org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:554)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:268)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:235)
at
org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:215)
at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:125)
at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:757)
.....
[WARN] 2013-01-10 10:45:26,883 [main]
org.apache.pdfbox.pdmodel.graphics.color.PDColorState - Unable to create
the color instance [0.0, 0.0, 0.0] in color space Indexed{ }; guessing
color ...
Interpretating as RGB
java.io.IOException: Not implemented
at
org.apache.pdfbox.pdmodel.graphics.color.PDIndexed.createColorSpace(PDIndexed.java:116)
at
org.apache.pdfbox.pdmodel.graphics.color.PDColorSpace.getJavaColorSpace(PDColorSpace.java:85)
at
org.apache.pdfbox.pdmodel.graphics.color.PDColorState.createColor(PDColorState.java:180)
....

Full debug log messages are available under :  http://pastebin.com/iMJS7guS


The original pdf can not be provided for now due to confidental data.

The converted image can be seen under http://postimage.org/image/kl0hfrk0b/

If more information is required, please let me know!
Output from "WriteDecodedDoc" not possible because
"java.lang.IndexOutOfBoundsException: Index: 12, Size: 12" has been
thrown...

Maybe Screenshot of PDFDebugger helpful? If yes, what part ?


What can I do to convert the inline images, too?

Thank you!


Greetings,
Sascha

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message