pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Munn <and...@nmedia.net>
Subject Re: PDFTextStripper parsing numbers backwards?
Date Mon, 16 Mar 2015 19:03:20 GMT


On Mon, 16 Mar 2015, Andreas Lehmkuehler wrote:
> Am 16.03.2015 um 01:45 schrieb Andrew Munn:
> > I'm parsing this doc
> > http://www.topazdevelopment.com/tmp/15-10145.pdf
> > 
> > 
> > page 14:
> > I have a doc with figures like $2,000 and when I extract the text it comes
> > over as 00.000,2$
> > 
> Activating the sorting should do the trick
> 
> textStripper.setSortByPosition(true)
> 

That improved some things but numbers are still backwards.  $3,000.00 
comes out 00.000,3$

See page #15 of the doc  http://www.topazdevelopment.com/tmp/15-10145.pdf

-Andrew


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Mime
View raw message