pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wu, Han" <w...@umdnj.edu>
Subject PDFBox does not support PDF forms made from Adobe Livecycle
Date Wed, 13 Jul 2011 14:36:15 GMT

I have tested PDFBox 1.6.0, the text extraction feature does not work with PDF forms designed
from Adobe Livecycle.

The output text file only has content like this:

"To view the full contents of this document, you need a later version
of the PDF viewer. You can upgrade to the latest version of Adobe
Reader from www.adobe.com/products/acrobat/readstep2.html<http://www.google.com/url?sa=D&q=www.adobe.com/products/acrobat/readstep2.html>

For further support, go to www.adobe.com/support/products/acrreader.html<http://www.google.com/url?sa=D&q=www.adobe.com/support/products/acrreader.html>"
I have tried all kinds of open source codes, all same problems, because I see that they do
not have decoding function for that.
Here is the explanation I found:

"PDFs created in LiveCycle Designer aren't the same as PDFs created in Acrobat or third-party
tools. They're really a different format
entirely. They contain an XDP (Acrobat XML Data Package File) payload within a PDF wrapper.

Please advise, thank you.

Wu, Han
Programmer Analyst
UMDNJ-ICPH   Dept. of Microbiology and Molecular Genetics  ICPH-E250R
Phone: 973-972-9005 (Tue, Thur)
UMDNJ-NJMS  Office of Research and Sponsored Programs  MSB-C690
Phone: 973-972-5237 (Mon, Wed)

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message