pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tilman Hausherr <THaush...@t-online.de>
Subject Re: PDFBOX - TIKA - PDF parsing error
Date Thu, 29 Jun 2017 19:22:43 GMT
Am 29.06.2017 um 12:33 schrieb Daniel MendesDaSilva:
>
> Daniel Mendes da Silva
> Senior Analyst Programmer
>
> From: Daniel MendesDaSilva
> Sent: 29 June 2017 12:25
> To: 'dev@pdfbox.apache.org'; 'users@pdfbox.apache.org'
> Subject: PDFBOX - TIKA - PDF parsing error
> Importance: High
>
> Hi,
>
> We're using PdfBox through Tika and we get an exception when parsing a 5 MB PDF file
- I'm not able to attach to this mail.
>
> Any ideas why we have this error?
> Why PDFBOX is trying to parse "-." as a number ?
>
>
> Caused by: java.io.IOException: Error expected floating point number actual='-.'
> Caused by: java.lang.NumberFormatException: null

Most likely your PDF is incorrect, even if Adobe Reader can display it. 
At some place, PDFBox expects a floating point number and gets "-." 
instead. Please upload the PDF to a sharehoster.

Tilman


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Mime
View raw message