pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tilman Hausherr <THaush...@t-online.de>
Subject Re: Extraction problems with PDFTextStripperByArea
Date Thu, 23 Jul 2015 18:24:54 GMT
Am 23.07.2015 um 20:03 schrieb Pierre Dubillot:
> If you are talking about the text extractor, it's realy strange, because on
> each page, column and lines are with the same position..
> Or is it the mediabox x/y positions ?

Sorry, ignore what I've written. I need to think more.

Tilman

> Le 23 juil. 2015 6:11 PM, "Tilman Hausherr" <THausherr@t-online.de> a
> écrit :
>
>> Am 23.07.2015 um 11:17 schrieb Pierre Dubillot:
>>
>>> http://pastebin.com/txLgqb7R
>>>
>>> Here's what I do to split my original page. But, I don't know what I
>>> should
>>> update for the mediabox...
>>>
>>>
>> I think that the problem isn't the media box, but the coordinates you use.
>> The mediabox is different because you use it as a "window" on a huge page.
>>
>> Tilman
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
>> For additional commands, e-mail: users-help@pdfbox.apache.org
>>
>>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Mime
View raw message