pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jaroslav Půbal <jaroslav.pu...@marbes.cz>
Subject Raw Image extraction - possible in PDFBox 1.x - impossible in 2.x
Date Thu, 27 Nov 2014 17:28:49 GMT
Hello,
i need extract RAW image from PDF. The image is EXIF tagged.
 
In PDFBox 1.x it was done with   
  PDXObjectImage.write2OutputStream(os);
 
In PDFBox 2.0.0 image extraction can be done with 
  ImageIO.write((PDImageXObject)image.getImage(), "jpg", os);
but this is not RAW data from PDF, this is complete image reencode, so EXIF is lost .
 
How to extract RAW image in PDFBox 2.0.0 ?
 
Thanks
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message