pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From George Van Treeck <tre...@yahoo.com>
Subject Bug or known limitation?
Date Tue, 15 Dec 2009 04:06:11 GMT
I ran into the exception below when using an older 0.8 version. So, I did a build using HEAD
from subversion. And the exception persists. The following is output from a little web crawler
I wrote.

ERROR: Unable to load PDF document: http://www.polaroid.com/media/document/a932manualEN20091019.pdf
java.io.IOException: Unknown xobject subtype 'PS'
at org.apache.pdfbox.pdmodel.graphics.xobject.PDXObject.createXObject(PDXObject.java:165)
at org.apache.pdfbox.pdmodel.PDResources.getXObjects(PDResources.java:161)
at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:226)
at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:206)
at org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:367)
at org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:291)
at org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:247)
at org.apache.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:180)
at webcrawler.WebCrawler.getContent(WebCrawler.java:1444)

-George

Mime
View raw message