pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rodrigo Caniçali <rodrigo.canic...@yahoo.com.br>
Subject WARNING: Did not found XRef object at specified startxref position
Date Fri, 01 Nov 2013 21:55:45 GMT

I found on a mailing list of 2012-jun-14 that this problem has been already discussed, but
here is pretty different.

I also get the warning "Did not found XRef object at specified startxref position xxx" when
executing the main function of org.apache.pdfbox.ExtractText class. However, some PDF texts
are ignored and are not printed on the output TXT file. These same texts are displayed by
Acrobat Reader and can be copyed by the user as texts from this program.

If the option "-nonSeq" is selected, then appears a "java.io.IOException: Error: Expected
a long type, actual=..." which stops the text extraction.

Please, is there any way to make it work?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message