pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andreas Lehmkuehler <andr...@lehmi.de>
Subject Re: Extracting local language (Sinhala Unicode) from a pdf
Date Sat, 22 Jun 2013 07:09:44 GMT

Am 20.06.2013 12:26, schrieb Supun Nakandala:
> Hi,
> I want to extract Sinhala (local language) from a pdf file. I am not
> familiar with pdfbox. I would like to know whether is this possible and how
> can I do it using pdfbox
I depends on the pdfs and the used kind of fonts. I suggest to give it a try.
There are some easy to use command line tools such as ExtractText, see [1]
for further details.

> Thank you.
> Regards Supun

Andreas Lehmkühler

[1] http://pdfbox.apache.org/commandline/

View raw message