pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allison, Timothy B." <talli...@mitre.org>
Subject Extracting rotated text
Date Mon, 25 Sep 2017 17:31:27 GMT
Colleagues,
Any recommendations for extracting rotated text such as: https://www.fsis.usda.gov/wps/wcm/connect/896bf55c-0d78-44a0-adfb-94f893eb0f72/GallagherEbelKause_74.pdf?MOD=AJPERES
?

Adobe DC gets reasonable text with "save as text".  PDFBox's ExtractText (and Tika) get something
like this:

FS
IS
L
is
te
ria
Li
st
er
ia
R
is
k
R
is
k
As
se
ss
m
en

Thank you!

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message