pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kishore Babu <kb...@envistacorp.com>
Subject extracting text from image using pdfbox
Date Fri, 12 Oct 2012 13:47:40 GMT
Hi All,
Is it possible to extract text from an image (JPEG) using pdfbox or is there any open source
java code for this?

When I try to  convert pdf to text, it is showing blank output. Then I converted into JPEG
image. The image contains the text properly, which I am failing to extract.

For normal pdf documents I am extracting text nicely using the standard process but when the
pdf document is an image, I am failing to extract the text that is present in the image.

Can anyone give directions on this, please?

Thanks in advance.



Kishore Babu I Developer
email: kbabu@envistacorp.com<mailto:kbabu@envistacorp.com>
office: 040.66417681
to enVista's Newsletter!

  [cid:image013.jpg@01CDA8AE.3F12B960] <http://twitter.com/enVistaSpplyChn>    [cid:image014.jpg@01CDA8AE.3F12B960]


  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message