pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jean Monchery <rmonch...@gmail.com>
Subject Re: PDFBOX - TIKA - PDF parsing error
Date Fri, 30 Jun 2017 10:22:48 GMT
Your issue might be with page four if your parsing those numbers with
integer.parse or long.parse. Just a wild guess. Are you getting the error
when your using the load method or something?

On Jun 30, 2017 3:17 AM, "Daniel MendesDaSilva" <
Daniel.MendesDaSilva@arhs-spikeseed.com> wrote:

> Hi
>
> Here is the file
>
>
>
> *         http://www51.zippyshare.com/v/pvzQaQrh/file.html
>
>
>
>
>
> On 2017-06-29 21:22 (+0200), Tilman Hausherr <T...@t-online.de> wrote:
> > Am 29.06.2017 um 12:33 schrieb Daniel MendesDaSilva:>
> > >>
> > > Daniel Mendes da Silva>
> > > Senior Analyst Programmer>
> > >>
> > > From: Daniel MendesDaSilva>
> > > Sent: 29 June 2017 12:25>
> > > To: 'dev@pdfbox.apache.org'; 'users@pdfbox.apache.org'>
> > > Subject: PDFBOX - TIKA - PDF parsing error>
> > > Importance: High>
> > >>
> > > Hi,>
> > >>
> > > We're using PdfBox through Tika and we get an exception when parsing a
> 5 MB PDF file - I'm not able to attach to this mail.>
>
> > >>
> > > Any ideas why we have this error?>
> > > Why PDFBOX is trying to parse "-." as a number ?>
> > >>
> > >>
> > > Caused by: java.io.IOException: Error expected floating point number
> actual='-.'>
> > > Caused by: java.lang.NumberFormatException: null>
> >
> > Most likely your PDF is incorrect, even if Adobe Reader can display it. >
> > At some place, PDFBox expects a floating point number and gets "-." >
> > instead. Please upload the PDF to a sharehoster.>
> >
> > Tilman>
> >
> >
> > --------------------------------------------------------------------->
> > To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org>
> > For additional commands, e-mail: users
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message