pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hesham G." <heshamgne...@gmail.com>
Subject Re: Extracting text from Arabic PDF - Text appears reveresed
Date Wed, 12 Jan 2011 07:35:41 GMT
I have now seen that this was fixed before, by including the ICU4J library ... Which is now
automatically included in PDFBox 1.4 ... And I was wondering why PDFBox 1.4 size was that
big 

Thanks to the PDFBox guys.


Best regards ,
Hesham 

---------------------------------------------
Included message :


Hello ,

I am using PDFBox 1.4 to extract text from an Arabic PDF file. The problem with Arabic is
that it is a right to left language. The words are read reversed.
Can this be fixed ?

Here is a 1 page Arabic PDF to try it : 
http://www.4shared.com/document/k6Mrafej/arabic_book.html


Best regards ,
Hesham
Mime
  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message