pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Branden Visser <mrvis...@gmail.com>
Subject Re: is it possible to batch extract text from pdf files within a tree of folders within a zip file ?
Date Wed, 20 Apr 2016 20:20:30 GMT
PDFBox can extract the text from the PDF files for you, however
unpacking the zip file, locating the PDF documents, saving in a
different format and rezipping I believe is something you'll have to
handle with other other libraries like commons-compress [1].

Hope that helps.


[1] https://commons.apache.org/proper/commons-compress/

On Wed, Apr 20, 2016 at 12:51 PM, David Green <david@davidgreen.co.uk> wrote:
> . . . and save the text files in the same tree structure on another drive ?
> this seems a big ask
> --
> Regards
> David

To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org

View raw message