pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dane Bezuidenhout <dane.bezuidenh...@sprinthive.com>
Subject How to logically read text from a PDF table?
Date Tue, 18 Jul 2017 13:28:35 GMT
The examples available are clear on constructing a table, but there is
little info on reading a table. I've investigated a few solution to this,
but feel that they are "hacky" in that they rely on establishing column and
row regions to read text from.

Surely there is a canonical way to traverse the PDDocument table elements
and access table cells with reference to row and columns?

Any advice would be appreciated.

Dane Bezuidenhout
SprintHive <https://sprinthive.com/>

M: +27 82 562 7850

vCard <http://www.sprinthive.com/files/dane.vcf>

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message