pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kishore Babu <kb...@envistacorp.com>
Subject extracting text from image using pdfbox
Date Fri, 12 Oct 2012 13:47:40 GMT
Hi All,
Is it possible to extract text from an image (JPEG) using pdfbox or is there any open source
java code for this?

When I try to  convert pdf to text, it is showing blank output. Then I converted into JPEG
image. The image contains the text properly, which I am failing to extract.

For normal pdf documents I am extracting text nicely using the standard process but when the
pdf document is an image, I am failing to extract the text that is present in the image.

Can anyone give directions on this, please?

Thanks in advance.

Regards,


[cid:image011.jpg@01CDA8AE.3F12B960]<http://www.envistacorp.com/>






Kishore Babu I Developer
email: kbabu@envistacorp.com<mailto:kbabu@envistacorp.com>
office: 040.66417681
www.envistacorp.com<http://www.envistacorp.com>
Subscribe<http://pages.exacttarget.com/page.aspx?QS=472529ec60bdf32a36ac1f221ebc1706b66778f96833e451f77fe13f8e4cf0db>
to enVista's Newsletter!

[cid:image012.jpg@01CDA8AE.3F12B960]<http://www.facebook.com/home.php?#%21/pages/enVista/281494945342?ref=search>
  [cid:image013.jpg@01CDA8AE.3F12B960] <http://twitter.com/enVistaSpplyChn>    [cid:image014.jpg@01CDA8AE.3F12B960]
<http://www.linkedin.com/companies/envista>





[cid:image015.jpg@01CDA8AE.3F12B960]<http://www.inc.com/inc5000/profile/envista>








Mime
  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message