pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Moshe Liaks <ajli...@gmail.com>
Subject Re: PDFBox - Read pdf file line by line using C#.Net
Date Wed, 18 Feb 2009 10:06:42 GMT
Hi Jukka,

Is there any tutorial, or any other material to learn how to do it?
Thanks,
Aldo.



On Wed, Feb 18, 2009 at 11:33 AM, Jukka Zitting <jukka.zitting@gmail.com>wrote:

> Hi,
>
> On Mon, Feb 16, 2009 at 6:15 PM, Moshe Liaks <ajliaks@gmail.com> wrote:
> > I use the code below to read a pdf file.
> > The code is working fine. The problem is that I have to read the pdf
> > line by line and not like "one big string".
> > I have this need, because the text is a complex one, and I need to
> > apply some filters while reading each line from the original.
>
> You could subclass the PDFTextStripper class, and do your filtering in
> the writeLineSeparator() method after buffering all the text on that
> line.
>
> BR,
>
> Jukka Zitting
>



-- 
___________________________
Have a Plenty and Powerfull Life!
Eng. Moshe (Aldo) Liaks

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message