pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pierre Dubillot <alexcou...@gmail.com>
Subject Re: Extraction problems with PDFTextStripperByArea
Date Thu, 23 Jul 2015 18:03:57 GMT
If you are talking about the text extractor, it's realy strange, because on
each page, column and lines are with the same position..
Or is it the mediabox x/y positions ?
Le 23 juil. 2015 6:11 PM, "Tilman Hausherr" <THausherr@t-online.de> a
écrit :

> Am 23.07.2015 um 11:17 schrieb Pierre Dubillot:
>
>> http://pastebin.com/txLgqb7R
>>
>> Here's what I do to split my original page. But, I don't know what I
>> should
>> update for the mediabox...
>>
>>
>
> I think that the problem isn't the media box, but the coordinates you use.
> The mediabox is different because you use it as a "window" on a huge page.
>
> Tilman
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message