pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jean Monchery <rmonch...@gmail.com>
Subject Re: PDFBOX - TIKA - PDF parsing error
Date Thu, 29 Jun 2017 19:38:17 GMT
Not sure butif your using java8 they allow _ as separators for big numbers
like 1_000_000_000. hope that helps

On Thu, Jun 29, 2017 at 3:22 PM, Tilman Hausherr <THausherr@t-online.de>
wrote:

> Am 29.06.2017 um 12:33 schrieb Daniel MendesDaSilva:
>
>>
>> Daniel Mendes da Silva
>> Senior Analyst Programmer
>>
>> From: Daniel MendesDaSilva
>> Sent: 29 June 2017 12:25
>> To: 'dev@pdfbox.apache.org'; 'users@pdfbox.apache.org'
>> Subject: PDFBOX - TIKA - PDF parsing error
>> Importance: High
>>
>> Hi,
>>
>> We're using PdfBox through Tika and we get an exception when parsing a 5
>> MB PDF file - I'm not able to attach to this mail.
>>
>> Any ideas why we have this error?
>> Why PDFBOX is trying to parse "-." as a number ?
>>
>>
>> Caused by: java.io.IOException: Error expected floating point number
>> actual='-.'
>> Caused by: java.lang.NumberFormatException: null
>>
>
> Most likely your PDF is incorrect, even if Adobe Reader can display it. At
> some place, PDFBox expects a floating point number and gets "-." instead.
> Please upload the PDF to a sharehoster.
>
> Tilman
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message