pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vinayaka Dalwai <vinayaka.dal...@gmail.com>
Subject Keeping data format intact after converting pdf file into txt
Date Tue, 13 Nov 2018 10:44:02 GMT
Hi :) ,
I have been converting many pdf files into txt files, thanks to Apache pdf
box.
However, I have recently come across a pdf file which after converting into
txt file does not retain the format that was in pdf file. The data is
completely disintegrated from the tables and all the data appear vertically.
Is there any way i can retain the format and tables. Any help on this would
be much appreciated.

Thanks & Regards,
Vinayaka

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message