pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonas Karlsson <thejo...@gmail.com>
Subject Re: TextExtraction only working after uncompressing with pdftk
Date Mon, 28 Apr 2014 16:21:47 GMT
Hi Tilman,

I tried the 1.8.5-SNAPSHOT and get the same result as before. No text and

Apr 28, 2014 12:20:48 PM org.apache.pdfbox.pdfparser.NonSequentialPDFParser
validateStreamLength

SEVERE: The end of the stream doesn't point to the correct offset, using
workaround to read the stream

_jonas

On Mon, Apr 28, 2014 at 11:04 AM, Tilman Hausherr <THausherr@t-online.de>wrote:

> There was a (recently fixed) bug with the LZW decoder, please try the
> current snapshot and tell us what happens
> https://repository.apache.org/content/groups/snapshots/org/
> apache/pdfbox/pdfbox/1.8.5-SNAPSHOT/
>
> Tilman
>
> Am 28.04.2014 17:00, schrieb Jonas Karlsson:
>
>  java.io.StreamCorruptedException: Error: data is null
>>
>>   at org.apache.pdfbox.filter.LZWFilter.decode(LZWFilter.java:82)
>>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message